Workflow relationship management and contextualization

ABSTRACT

Described herein are methods and system for electronic workflow management having a central server that periodically scans data accessible to multiple computers and data interacted with by different users to generate a nodal data structure comprising of interrelated nodes where each node corresponds to a workflow component, such as files, messages, tasks, and the like. The server then executes various analytical protocols to identify and link/merge nodes corresponding to related content. The server then allows a user to customize graphical user interfaces where the server arranges the content of each graphical user interface based on their corresponding relationships within the nodal data structure.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part application of U.S. patentapplication Ser. No. 16/164,465, filed Oct. 18, 2018, which claimspriority to U.S. Provisional Application No. 62/573,914, filed Oct. 18,2017, each of which is hereby incorporated by reference in its entirety.

This application claims priority to U.S. Provisional Patent ApplicationNo. 62/854,682, filed May 30, 2019, which is incorporated by referencein its entirety.

TECHNICAL FIELD

This application relates generally to electronic data management anddata mapping.

BACKGROUND

Productivity tools utilize software to track, manage, and/or storefiles, contacts, messages, comments, tasks, and more within differentcomputing environments. Most conventional productivity tools systems arecapable of keeping records of various versions, edits, collaborations,and/or related data (files, tasks, contacts, messages, comments, etc.)created or accessed by different users. Utilizing conventionalproductivity software systems, users can access and interact withvarious Components of Work, such as files, messages, tasks, etc. withindifferent computing environments. Users can also monitor a revision/edithistory of various Components of Work. However, conventionalproductivity systems suffer from at least two technical problems.

First, conventional productivity systems require users to store allfiles/data on a central electronic data repository (e.g., database,cloud storage, and the like). Storing large volumes of data on a centraldata repository requires high amount of data storage, which is costly,inefficient, and/or even impractical when working with clients orcollaborators that prefer using repositories from competing providers.For instance, managing interrelated Components of Work in a centralrepository requires significant processing power due to number, size,content, and relationships between the Components of Work. Onlinecollaborative applications are increasingly emerging where multipleusers can simultaneously access, store, share, edit, comment on, andcreate tasks for online files. Many of these applications come withtheir own file types, their own file storage, their own messaging orcommenting features, their own project management, contact lists, andeven their own file browser. Ultimately, many people today findthemselves in a digital reality defined by data/content that is highlyfragmented. As a result, conventional productivity software systemseither require high processing power, which is costly, or do not monitorthe files in a timely manner, which is highly undesirable andinefficient.

Second, conventional productivity software systems may require users todesignate related Components of Work. For instance, conventionalworkflow management systems require users to either tag relatedfiles/messages/tasks, designate a related project to afile/message/task, or name files/tasks/contacts in accordance withpredetermined naming requirements. Therefore, conventional productivitysoftware systems shift the burden of identifying relationships betweencomponents/units of work (e.g., files, messages, tasks, contacts) ontousers, which is highly undesirable and creates a negative userexperience.

SUMMARY

For the aforementioned reasons, there is a need for a workflowmanagement system and method that can automatically and efficientlyidentify related Components of Work, collect data surrounding therelated Components of Work, and display the results in real-time or nearreal-time. There is a need for a workflow management system that doesnot require users to manually identify related files, communication,projects, or other context. Furthermore, there is a need for a workflowmanagement system that does not require users to store all their data ina single central electronic data repository.

Even though certain embodiments herein are described in the context offiles, it is expressly understood that the methods and systems describedherein apply to any components of work. For instance, while methods,systems, and embodiments disclosed herein describe analytics server (orany other server or computing device) identifying whether two files arerelated, the same servers or other computing devices can use thedisclosed methods, systems, and embodiments to identify relatedcomponents of work. Non-limiting examples of a Component of Work mayinclude messages, tasks, contacts, deals, clients, employees, contacts,notifications, and the like.

In an embodiment, a method comprises periodically scanning, by a server,a plurality of electronic data repositories accessible to a plurality ofcomputing devices to identify a plurality of files stored onto theplurality of electronic data repositories where each file is accessibleto at least one computing device within the plurality of computingdevices; executing, by the server, a protocol to generate at least oneunique identifier of each file within the plurality of files;retrieving, by the server for each file, context data comprising atleast one of a time stamp, access history, and edit history associatedwith each identified electronic file; generating, by the server, anartificial intelligence model comprising a set of nodes where each nodecomprises metadata indicating a unique identifier and context data foreach respective electronic file; executing the artificial intelligencemodel to identify a likelihood of reliance value for each pair ofelectronic files within the identified plurality of electronic filed,wherein each likelihood of relevance value correspondents to alikelihood that two electronic files are related to each other; for eachpair of electronic files having a likelihood of relevance value thatsatisfy a threshold: merging, by the server, respective nodescorresponding to the pair of electronic files; in response to receiving,from an electronic client device, a request to access a file,identifying, by the server, a node corresponding to the requested file;and displaying, by the server, an indication of the file and any otherfiles with a related node to the identified node of the requested file.

In an embodiment, a method comprises periodically scanning, by a server,a plurality of electronic data repositories accessible to a plurality ofcomputing devices to identify a plurality of files stored onto theplurality of electronic data repositories where each file is accessibleto at least one computing device within the plurality of computingdevices; executing, by the server, a protocol to generate at least oneunique identifier of each file within the plurality of files;retrieving, by the server for each file, context data comprising atleast one of a time stamp and access history; executing, by the server,a computer model to identify related files based on each file's contextdata and unique identifier to generate a plurality of groups where eachgroup comprises at least one electronic file; monitoring, by the server,electronic communication between a set of users to identify a set ofelectronic communication events between at least two users, where eachelectronic communication is associated with at least one group of files;identifying, by the server, context data associated with each identifiedelectronic communication event, the context data comprising at least atime stamp of each electronic communication event and an electronic fileand its corresponding group; generating, by the server, a score for eachuser within the set of users, the score corresponding to the identifiedcontext data of the identified electronic communication events; andgenerating, by the server, a graphical user interface having a set ofgraphical component each representing at least one user within the setof users, wherein the set of graphical components are arranged inaccordance with the score of each user represented by each respectivegraphical component.

In another embodiment, context data is associated with at least oneelectronic communication event comprises a text of that electroniccommunication event. In another embodiment, the score for each userfurther corresponds to the text of at least one electronic communicationevent. In another embodiment, the server filters or sorts the set ofgraphical elements within the graphical user interface based on anattribute. In another embodiment, the server displays a list ofattributes on the graphical user interface. In another embodiment, theattribute corresponds to a department, office, role, team, project, orskill associated with at least one user. In another embodiment, when auser interacts with at least one graphical component representing atleast one user, the server displays a profile of that user. In anotherembodiment, the server displays at least one group of files associatedwith the user. In another embodiment, the server displays at least oneapplication associated with the user. In another embodiment, the serverdisplays at least one other user associated with the selected user.

In another embodiment, a method comprises periodically scanning, by aserver, a plurality of electronic data repositories accessible to aplurality of computing devices to identify a plurality of files storedonto the plurality of electronic data repositories where each file isaccessible to at least one computing device within the plurality ofcomputing devices; executing, by the server, a protocol to generate atleast one unique identifier of each file within the plurality of files;retrieving, by the server for each file, context data comprising atleast one of a time stamp and access history; executing, by the server,a computer model to identify related files based on each file's contextdata or unique identifier; generating, by the server, a nodal datastructure comprising a set of nodes where each node within at least asubset of the set of nodes corresponds to a file within the set offiles, each node within the set of nodes linked to at least one othernode; in response to receiving a time period input from a user,identifying, by the server, a cluster of linked nodes associated withthe user during the time period inputted; and dynamically displaying, bythe server, a graphical element corresponding to the identified clusterof linked nodes.

In another embodiment, a system comprises a server in communication withat least one electronic data repository and a plurality of electronicdevices each configured to access at least one file stored onto the atleast one electronic data repository, the server comprising a processorand a non-transitory computer-readable medium containing instructionsthat when executed by the processor causes the processor to performoperations comprising periodically scan the plurality of electronic datarepositories accessible to the plurality of computing devices toidentify a plurality of files stored onto the plurality of electronicdata repositories where each file is accessible to at least onecomputing device within the plurality of computing devices; execute aprotocol to generate at least one unique identifier of each file withinthe plurality of files; retrieve for each file, context data comprisingat least one of a time stamp and access history; execute a computermodel to identify related files based on each file's context data orunique identifier; generate a nodal data structure comprising a set ofnodes where each node within at least a subset of the set of nodescorresponds to a file within the set of files, each node within the setof nodes linked to at least one other node; in response to receiving atime period input from a user, identify a cluster of linked nodesassociated with the user during the time period inputted; anddynamically display a graphical element corresponding to the identifiedcluster of linked nodes.

In another embodiment, a method comprises periodically scanning, by aserver, a plurality of electronic data repositories accessible to aplurality of computing devices to identify a plurality of files storedonto the plurality of electronic data repositories where each file isaccessible to at least one computing device within the plurality ofcomputing devices; executing, by the server, a protocol to generate atleast one unique identifier of each file within the plurality of files;retrieving, by the server for each file, context data comprising atleast one of a time stamp and access history; executing, by the server,a computer model to identify related files based on each file's contextdata or unique identifier; generating, by the server, a nodal datastructure comprising a set of nodes where each node within at least asubset of the set of nodes corresponds to a file within the set offiles, each node within the set of nodes linked to at least one othernode; receiving, by the server, a prioritization attribute associatefrom a user device; receiving, by the server, a set of notificationsassociated with the user device; identifying, by the server within thenodal data structure, a node associated with each notification;generating, by the server, a score for each notification, the scorecorresponding to the prioritization attribute and the identified nodeassociated with each respective node; and outputting, by the server, theset of notifications in an order corresponding to each notification'srespective score.

In another embodiment, a server in communication with at least oneelectronic data repository and a plurality of electronic devices eachconfigured to access at least one file stored onto the at least oneelectronic data repository, the server comprising a processor and anon-transitory computer-readable medium containing instructions thatwhen executed by the processor causes the processor to performoperations comprising periodically scan the plurality of electronic datarepositories accessible to the plurality of computing devices toidentify a plurality of files stored onto the plurality of electronicdata repositories where each file is accessible to at least onecomputing device within the plurality of computing devices; execute aprotocol to generate at least one unique identifier of each file withinthe plurality of files; retrieve for each file, context data comprisingat least one of a time stamp and access history; execute a computermodel to identify related files based on each file's context data orunique identifier; generate a nodal data structure comprising a set ofnodes where each node within at least a subset of the set of nodescorresponds to a file within the set of files, each node within the setof nodes linked to at least one other node; receiving, by the server, aprioritization attribute associate from a user device; receiving, by theserver, a set of notifications associated with the user device;identifying, by the server within the nodal data structure, a nodeassociated with each notification; generating, by the server, a scorefor each notification, the score corresponding to the prioritizationattribute and the identified node associated with each respective node;and outputting, by the server, the set of notifications in an ordercorresponding to each notification's respective score.

In another embodiment, a method comprises periodically scanning, by aserver, a plurality of electronic data repositories accessible to aplurality of computing devices to identify a plurality of files storedonto the plurality of electronic data repositories where each file isaccessible to at least one computing device within the plurality ofcomputing devices; executing, by the server, a protocol to generate atleast one unique identifier of each file within the plurality of files;retrieving, by the server for each file, context data comprising atleast one of a time stamp and access history; executing, by the server,a computer model to identify related files based on each file's contextdata or unique identifier; generating, by the server, a nodal datastructure comprising a set of nodes where each node within at least asubset of the set of nodes corresponds to a file within the set offiles, each node within the set of nodes linked to at least one othernode; receiving, by the server, an attribute from a user; identifying,by the server within the nodal data structure, a cluster of relatednodes associated with the received attribute; and when the userinitiates a browser application, dynamically generating, by the server,a first link displayed by the browser application and configured todirect the browser application to one or more electronic filesassociated with identified cluster of related nodes.

In another embodiment, a system comprises a server in communication withat least one electronic data repository and a plurality of electronicdevices each configured to access at least one file stored onto the atleast one electronic data repository, the server comprising a processorand a non-transitory computer-readable medium containing instructionsthat when executed by the processor causes the processor to performoperations comprising: periodically scan a plurality of electronic datarepositories accessible to a plurality of computing devices to identifya plurality of files stored onto the plurality of electronic datarepositories where each file is accessible to at least one computingdevice within the plurality of computing devices; execute a protocol togenerate at least one unique identifier of each file within theplurality of files; retrieve, for each file, context data comprising atleast one of a time stamp and access history; execute a computer modelto identify related files based on each file's context data or uniqueidentifier; generate a nodal data structure comprising a set of nodeswhere each node within at least a subset of the set of nodes correspondsto a file within the set of files, each node within the set of nodeslinked to at least one other node; receive an attribute from a user;identify, within the nodal data structure, a cluster of related nodesassociated with the received attribute; and when the user initiates abrowser application, dynamically generate a first link displayed by thebrowser application and configured to direct the browser application toone or more electronic files associated with identified cluster ofrelated nodes.

BRIEF DESCRIPTION OF THE DRAWINGS

Non-limiting embodiments of the present disclosure are described by wayof example with reference to the accompanying figures, which areschematic and are not intended to be drawn to scale. Unless indicated asrepresenting the background art, the figures represent aspects of thedisclosure.

FIG. 1 illustrates components of an electronic workflow managementsystem, in accordance with an embodiment.

FIG. 2 is a flow diagram of a process executed in an electronic workflowmanagement system, in accordance with an embodiment.

FIG. 3 is a nodal data structure representing multiple identified fileswithin a computing environment, in accordance with an embodiment.

FIG. 4 is a nodal data structure representing multiple identified fileswithin a computing environment, in accordance with an embodiment.

FIG. 5 is a graphical user interface displaying file contextinformation, in accordance with an embodiment.

FIG. 6A is a graphical user interface displaying a file and its contextinformation, in accordance with an embodiment.

FIG. 6B is a graphical user interface displaying a file and its contextinformation, including a related email message, in accordance with anembodiment.

FIG. 6C is a graphical user interface displaying a folder and itscontext information, including related tasks, in accordance with anembodiment.

FIGS. 7A and 7B illustrate a messaging application, in accordance withan embodiment.

FIG. 8 is a flow diagram of a process executed in an electronic workflowmanagement system, in accordance with an embodiment.

FIG. 9 is a visual representation of a nodal data structure, inaccordance with an embodiment.

FIG. 10 is a visual representation of a nodal data structure, inaccordance with an embodiment.

FIG. 11 is a flow diagram of a process executed in an electronicworkflow management system, in accordance with an embodiment.

FIG. 12 is a visual representation of a nodal data structure, inaccordance with an embodiment.

FIG. 13A is a flow diagram of a process executed in an electronicworkflow management system, in accordance with an embodiment.

FIG. 13B illustrates a nodal data structural corresponding to anorganization's hierarchy, in accordance with an embodiment.

FIGS. 13C-D illustrate graphical user interfaces displaying anorganization's hierarchy, in accordance with an embodiment.

FIG. 14 is a flow diagram of a process executed in an electronicworkflow management system, in accordance with an embodiment.

FIGS. 15A-D illustrate graphical user interfaces displaying an enhancedtimesheet application, in accordance with an embodiment.

FIGS. 16A and 16B illustrate a flow diagram of a process executed in anelectronic workflow management system, in accordance with an embodiment.

FIG. 17A illustrates a schematic representation of prioritizednotification system provided by an electronic workflow managementsystem, in accordance with an embodiment.

FIG. 17B illustrates a flow diagram of a process executed in anelectronic workflow management system, in accordance with an embodiment.

FIG. 17C illustrates graphical user interfaces displaying prioritizednotification, in accordance with an embodiment.

FIG. 18A illustrates a flow diagram of a process executed in anelectronic workflow management system, in accordance with an embodiment.

FIGS. 18B-M illustrate graphical user interfaces displayed by anelectronic workflow management system, in accordance with an embodiment.

FIGS. 19-21 illustrate graphical user interfaces displayed by anelectronic workflow management software, in accordance with anembodiment.

FIGS. 22A to 25B show non-limiting examples of workflow managementsystems, in accordance with an embodiment.

FIGS. 26-27 illustrate graphical user interfaces displayed by anelectronic workflow management system, in accordance with an embodiment.

FIG. 28 illustrates a graphical user interface displayed by anelectronic workflow management system, in accordance with an embodiment.

FIG. 29 illustrates a graphical user interface displayed by anelectronic workflow management system, in accordance with an embodiment.

FIG. 30 illustrates graphical user interfaces displayed by an electronicworkflow management system, in accordance with an embodiment.

DETAILED DESCRIPTION

Reference will now be made to the illustrative embodiments depicted inthe drawings, and specific language will be used here to describe thesame. It will nevertheless be understood that no limitation of the scopeof the claims or this disclosure is thereby intended. Alterations andfurther modifications of the inventive features illustrated herein, andadditional applications of the principles of the subject matterillustrated herein, which would occur to one skilled in the relevant artand having possession of this disclosure, are to be considered withinthe scope of the subject matter disclosed herein. Other embodiments maybe used and/or other changes may be made without departing from thespirit or scope of the present disclosure. The illustrative embodimentsdescribed in the detailed description are not meant to be limiting ofthe subject matter presented.

FIG. 1 illustrates components of an electronic workflow managementsystem 100. The electronic workflow management system 100 may also bereferred to herein at the electronic workflow management system. Theelectronic workflow management system 100 may include an analyticsserver 110, an administrator computing device 120, user computingdevices 140 a-e (collectively user computing devices 140), electronicdata repositories 150 a-d (collectively electronic data repositories150), and third-party server 160. The above-mentioned components may beconnected to each other through a network 130. The examples of thenetwork 130 may include, but are not limited to, private or public LAN,WLAN, MAN, WAN, and the Internet. The network 130 may include both wiredand wireless communications according to one or more standards and/orvia one or more transport mediums.

The communication over the network 130 may be performed in accordancewith various communication protocols such as Transmission ControlProtocol and Internet Protocol (TCP/IP), User Datagram Protocol (UDP),and IEEE communication protocols. In one example, the network 130 mayinclude wireless communications according to Bluetooth specificationsets, or another standard or proprietary wireless communicationprotocol. In another example, the network 130 may also includecommunications over a cellular network, including, e.g., a GSM (GlobalSystem for Mobile Communications), CDMA (Code Division Multiple Access),EDGE (Enhanced Data for Global Evolution) network.

The electronic workflow management system 100 is not confined to thecomponents described herein and may include additional or alternatecomponents, not shown for brevity, which are to be considered within thescope of the electronic workflow management system 100.

The analytics server 110 may generate and display a graphical userinterface (GUI) on each user computing devices 140 within a network 180.The analytics server 110 may also display the GUI on theadministrator-computing device 120. An example of the GUI generated andhosted by the analytics server 110 may be a web-based application or awebsite, as depicted in FIGS. 5, 6A-C, 7A-B, 15, and 19-29.

The analytics server 110 may host a website accessible to end-users,where the content presented via the various webpages may be controlledbased upon each particular user's role. The analytics server 110 may beany computing device comprising a processor and non-transitorymachine-readable storage capable of executing the various tasks andprocesses described herein. Non-limiting examples of such computingdevices may include workstation computers, laptop computers, servercomputers, cell phones, and the like. While the electronic workflowmanagement system 100 includes a single analytics server 110, in someconfigurations, the analytics server 110 may include any number ofcomputing devices operating in a distributed computing environment toachieve the functionalities described herein.

The analytics server 110 may execute software applications configured todisplay the GUI (e.g., host a website), which may generate and servevarious webpages to each user computing device 140 and/or theadministrator computing device 120. Different users operating the usercomputing devices 140 may use the website to generate, access, and storedata (e.g., files) stored on one or more of the electronic datarepositories 150, as described in detail in FIGS. 2, 5, 6, and 7. Insome implementations, the analytics server 110 may be configured torequire user authentication based upon a set of user authorizationcredentials (e.g., username, password, biometrics, cryptographiccertificate, and the like). In such implementations, the analyticsserver 110 may access a system database 150 d configured to store usercredentials, which the analytics server 110 may be configured toreference in order to determine whether a set of entered credentials(purportedly authenticating the user) match an appropriate set ofcredentials that identify and authenticate the user.

As described herein a file refers to contained data available to atleast one operating system and/or at least one software program. A filemay contain data, such as text, video, computer program, audio, and thelike. Furthermore, a file can also refer to a path associated with data.For example, a file, as used herein, can refer to a traditional file orfolder on a local machine, a shortcut to a file/folder on a differentmachine, and/or a reference to a file/folder in an email message.Another non-limiting example of a file may include a reference to thelocation of a file/folder by website URL or file/folder path, afile/folder that only exists online or is not traditionally saved to alocal machine's normal file. The path may not be accessible through themain system's file browser (e.g., Google Docs®, Evernote Notes®, and thelike) that are not typically accessible through a computer's WindowsExplorer or MacOS Finder unless explicitly downloaded to a folder in adifferent format that might lose either functionality or context such asrelated content and comments). In some configurations, the analyticsserver 110 may provide an application native to the user computingdevices 140 or other electronic devices used by users where users mayaccess the native application using the user computing devices 140 orany other computing devices (e.g., personal electronic devices) togenerate, access, store, or otherwise interact with data stored onto theelectronic data repositories 150. The native application may be anyapplication that is directly in communication with the analytics server110. For example, the native application may be a mobile application,cloud-based application, universal GUI, and/or virtual/cloud-based“desktop” where users (upon being authenticated) can access, interactwith, and manipulate data stored onto the electronic data repositories150.

In some configurations, the analytics server 110 may generate and hostwebpages based upon a particular user's role within the electronicworkflow management system 100 (e.g., administrator, employee, or theemployer). In such implementations, the user's role may be defined bydata fields and input fields in user records stored in the systemdatabase 150 d. The analytics server 110 may authenticate each user andmay identify the user's role by executing an access directory protocol(e.g., LDAP). The analytics server 110 may generate webpage content,access or generate data stored in the electronic data repositories 150,according to the user's role defined by the user record in the systemdatabase 150 d. For instance, a user may be defined as a lower levelemployee who may not be authorized to view all related content to aparticular sensitive file. Therefore, the analytics server 110 maycustomize the GUI according to the user's authentication level.Furthermore, the analytics server 110 may customize the GUI according toa user's role (e.g., function type). For instance, the analytics server110 may customize the GUI based on whether a user is a designer or anaccount manager.

In operation, when instructed by the administrator-computing device 120and/or any user-computing device 140, the analytics server 110 mayexecute various scanning and crawling protocols to identify and map datastored onto each electronic data repository 150. As described herein,the analytics server 110 may also execute various predeterminedprotocols to generate unique identifiers for the above-describedfiles/data, identify related files, create a nodal data structure,periodically scan the electronic data repositories, update the nodaldata structure, and display related files and context information on theabove-described GUI. In some implementations, the analytics server 110may incorporate the GUI into a third-party application, such as athird-party email application or a file sharing/management applicationwhile preserving the “look and feel” of the third-party application.

In some configurations, the analytics server 110 may compare uniqueidentifiers included in the metadata of each file. For instance, a filemay have metadata that includes unique identifiers associated withelements related to the file (e.g., email, tasks, storage location, andthe like). In some embodiments, the analytics server 110 may use theseunique identifiers to determine whether the file is related to any otherfiles.

User computing devices 140 may be any computing device comprising aprocessor and a non-transitory machine-readable storage medium capableof performing the various tasks and processes described herein.Non-limiting examples of a user-computing device 140 may be aworkstation computer, laptop computer, tablet computer, and servercomputer. As depicted in FIG. 1, the user computing devices 140 may eachbe operated by a user within the network 180. In a non-limiting example,the network 180 represents an internal network and/or collection ofcomputing devices connected within an entity. For instance, network 180may represent all computing devices operated by all employees of acompany. User computing devices 140 may be internally interconnected viaan internal and/or private network of the network 180 (not shown). Forinstance, a company's intranet or any other private network may connectall the company's computing devices. In FIG. 1, user-computing devices140 are interconnected within the network 180 (e.g., belong to the samecompany).

Even though the depicted user computing devices 140 are within the samenetwork (e.g., network 180), it is expressly understood that theservices provided by the analytics server 110 may not be limited tocomputers within the same network. For instance, the analytics server110 may scan files accessible to one or more user computing devices thatare not interconnected and are not within the same network. In someother embodiments, the analytics server 110 may only monitor acustomized and/or predetermined portion of the computing devices 140.For instance, the administrator-computing device 120 may customize alist of user computing device 140 and their corresponding electronicrepository 150 to be monitored by the analytics server 110.

Each user computing device 140 may access one or more electronic datarepositories 150 to access (e.g., view, delete, save, revise, share,send, communicate around, and the like) data stored onto the one or moreelectronic data repositories 150. For instance, user-computing device140 a may access data within a local database 150 a. User computingdevice 140 b and 140 c may access a shared database 150 b. Usercomputing device 140 d may access a cloud storage 150 c. Furthermore,user-computing device 140 e may access a database operationally managedby the analytics server 110, such as the system database 150 d. Thenetwork 180 may also include the third-party server 160 where one ormore user computing devices 140 utilize the third-party server 160 toaccess, store, and/or manage data. An example of the third-party server160 may be an email server, a third party (or homegrown) electronic filemanagement server, a public web site for hosting and sharing specificfile types (e.g., YouTube® for videos, Behance® for graphic files, andLinkedIn Slideshare® for presentations), or any other server used toaccess and/or store data files.

In some configurations, data accessible to the user computing devices140 may be stored in a distributed manner onto more than one electronicrepositories. For instance, one or more files may be stored onto ablockchain accessible to the user computing devices 140 where theblockchain comprises multiple distributed nodes storing data ontodisparate electronic repositories. The analytics sever 110 may retrievea public or private blockchain key associated with each user and/or eachuser computing device 140 to access the blockchain and monitor datastored onto the blockchain.

Even though different user computing devices 140 are depicted as having,access to different electronic data repositories 150, it is expresslyunderstood that in different embodiments and configurations, one or moreuser computing devices 140 may have access to a combination of differentelectronic repositories 150. For instance, user-computing device 140 amay utilize the third-party server 160 and the local database 150 a tostore data. In another example, user-computing device 140 c may utilizedatabase 150 b, cloud storage 150 c and the third-party server 160 toaccess files/data. For the purpose of brevity, different combinations ofdifferent user computing devices 140 having access to differentelectronic data repositories 150 are not shown.

FIG. 2 illustrates a flow diagram of a process executed in an electronicworkflow management system, in accordance with an embodiment. The method200 includes steps 210-270. However, other embodiments may includeadditional or alternative execution steps, or may omit one or more stepsaltogether. In addition, the method 200 is described as being executedby a server, similar to the analytics server described in FIG. 1.However, in some embodiments, steps may be executed by any number ofcomputing devices operating in the distributed computing systemdescribed in FIG. 1. One or more user computing devices or anadministrator-computing device may, locally perform for instance, partor all the steps described in FIG. 2. Furthermore, even though someaspects of the method 200 is described in the context of a web-basedapplication, in other configurations, the analytics server may displayrelated data in a mobile application or an application native to theuser's desktop.

At step 210, the analytics server may periodically scan a plurality ofelectronic data repositories accessible to a plurality of computingdevices to identify a plurality of files stored onto the plurality ofelectronic data repositories where each file is accessible to at leastone computing device within the plurality of computing devices. Asdiscussed above, the analytics server may periodically scan one or moreelectronic data repositories to identify electronic files stored ontosuch electronic repositories.

The analytics server may require all users to create accounts and grantpermission to the analytics server to periodically monitor filesaccessible to each user and/or computing device operated by each user.In some configurations, the analytics server may provide a web-basedapplication displaying various prompts allowing each user to grant theanalytics server permission to periodically monitor all files accessibleand/or revised by each user. The web-based application may provide atleast five monitoring functionalities: 1) files saved on any electronicdata repository accessible by each user; 2) each user's emailcommunication; 3) each user's chat/messaging activity; 4) each user'stask management or project management; and 5) each user's calendarevents.

During the account registration process, the web-based application maydisplay one or more prompts allowing each user to connect his or heremail accounts, messaging tools, task management tools, projectmanagement tools, calendars, organizational or knowledge managementtools (e.g., Evernote®, Atlassian Confluence®, etc.), othercollaborative tools (e.g., Basecamp®, Smartsheet®, etc.) and/orelectronic repository systems (e.g., local database, cloud storagesystems, and the like) to the analytics server. The prompt may alsoinclude one or more text input fields where each user can inputidentification and authentication credentials for his email accounts,messaging tools, electronic repository systems, and/or third partyapplications, such as project management tool, time trackingapplications, billing, issue tracking, web accounts (e.g., YouTube®),online applications (e.g., Figma®, Onshape®, Google Docs®, and thelike). For example, a user may enter his email address and password inthe input fields displayed by the analytics server. Upon receipt, theanalytics server may use the authentication credentials to remotelylogin the above-described portals and monitor all files accessibleand/or revised by each user and/or all files saved on the electronicdata repositories.

Upon receiving permission and/or authorization from users, the analyticsserver may scan the one or more electronic data repositories accessibleto each user. The analytics server may execute a scanning or crawlingprotocol where the analytics server crawls different databases toidentify all files accessible to each user.

As discussed above, an electronic repository may represent anyelectronic repository storing files that are accessible to one or morecomputers within an entity or a network. Non-limiting examples of anelectronic repository may include a database, cloud storage system,third-party shared drives, third-party application as described above,internal file transfer protocol (FTP), and internal or external databaseoperated by the analytics server, email storage, HR systems, accountingsystems, customer relationship management (CRM) systems, and the like.

The analytics server may, upon receiving permission from one or morecomputing devices periodically scan the above-described electronicrepositories and identify one or more files stored onto these electronicrepositories. For instance, an administrator of an entity may grantpermission to the analytics server to scan all repositories accessibleto all computers within the entity.

Upon identification of each file, the analytics server may search dataassociated with the identified files and may re-create an activitytimeline for each user. The activity timeline may present historicaldata associated with each file and each user. For instance, when theanalytics server identifies a file (e.g., Sample.doc), the analyticsserver may further identify a history of Sample.doc by analyzing saidfile's history (e.g., revision, communication, and access history of thefile). As a result, the analytics server may create a timeline thatindicates every interaction (e.g., file generation, revisions,modification, and the like) with Sample.doc.

In some configurations, the analytics server may retrieve the filehistory and other related data (e.g., context data) using an applicationprogramming (API) interface in communication with the electronic datarepositories. For instance, the analytics server may be prohibited fromaccessing a third-party shared drive. In those embodiments, theanalytics server may use an API configured to communicate with the thirdparty shared drive to identify and monitor files. The analytics servermay further use a similar protocol to determine whether a file has beenrevised/modified. For instance, the analytics server may cause an API toconnect/sync with a third-party document sharing application. Theanalytics server may also cause the API to transmit a notification foreach instance that a file, stored on the third-party document sharingapplication, is accessed and/or revised by a user.

In some configurations, third-party service providers of shared documentdrives may not allow the API to transfer detailed data regarding filerevisions. For instance, third-party service providers may only transmita notification that a file has been accessed and/or revised by a user.However, the API notification may not contain the revision (e.g., changeof text, formatting, and the like) to the file. In those embodiments,the analytics server may remotely access the shared drive, usingcredentials obtained from the user during the account registrationprocess, obtain a copy of the file, and compare the file to a previousversion.

The analytics server may also include the API notification in themetadata profile of each identified file. For instance, the analyticsserver may receive an API notification that a first user has shared FileX with a second user on a third-party document sharing application. TheAPI notification may not include any specific data regarding the contentof File X because the analytics server may be prohibited from retrievinga copy of File X. The analytics server may include the document sharingactivity in the metadata of File X (within the nodal data structuredescribed herein), which may include a timestamp of the document shareand data associated with the first user and the second user. As aresult, the analytics server may reconstruct an activity timeline forFile X that includes information on how File X was shared (e.g., mediumand timestamp) and different users who interacted with File X.

In another example, user 1 may share File X with user 2 using athird-party file management application. Using an API connect to thethird-party file management application, the analytics server mayreceive a notification that File X was shared between two users at acertain time. The API notification may not include user identifiers andmay not identify the sender or the receiver of File X. The third-partyfile management application may also notify user 1 and/or user 2regarding the file sharing. For instance, the third-party filemanagement application may send an email to user 2 informing user 2 thatuser 1 has shared File X with user 2. The email may also include anidentifier associated with File X (e.g., URL of File X). Because theanalytics server has access to emails of user 1 and user 2, theanalytics server can identify that user 1 has shared File X with user 2.The analytics server may then include the file path, timestamp of theemail, and timestamp of the file share, in the File X's metadata file.In some configurations, the analytics server may create a node for theemail and/or the file path (e.g., URL) included in the email.

At step 220, the analytics server may execute a predetermined protocolto generate a unique identifier of each file within the plurality offiles. Upon identifying each file, the analytics server may execute apredetermined hashing algorithm and generate a unique identifier foreach file identified in step 210. A hashing algorithm or protocol may bea function that converts a data string into a numeric string output offixed lengths. The output string is generally much smaller than theoriginal data. Hash algorithms are designed to be collision-resistant.In other words, there is a very low probability that the same stringwould be created for files with dissimilar content. Simply put, theanalytics server may create a unique identifier for each file identifiedin step 210. As described herein, the analytics server may use theunique identifier (e.g., generated hash string) to compare files witheach other and/or with their previous versions.

Using the hashing algorithm, the analytics server may eliminate the needto retrieve files and to execute file-comparison protocols, whichrequires extensive computing power and increases the chance ofunintended file corruption. For instance, the analytics server may storethe unique identifier for each file in a database. Upon detecting arevision of a file, the analytics server may execute the same hashingprotocol using the purportedly revised file and compare the newlygenerated unique identifier with the previously stored uniqueidentifier. In this way, the analytics server identifies whether thefile has been revised using less processing power than conventionalsystems.

In some configurations, the analytics server may use message-digestprotocol 5 (MD5) hashing protocol to generate the unique identifier foreach identified file. Using the MD5 hashing protocol, the analyticsserver may generate a unique identifier for each file that contains a128-bit hash value representing the content of each identified file.Because the unique identifier comprises an output string, which issmaller than the original file, executing data-comparing protocols onthe unique identifier (rather than performing the same protocols on theoriginal file) requires much less processing power.

Each element related to a file may include a unique identifier. Forinstance, where a file is stored (storage drive or a storage folder) mayhave its unique identifier. When a file is sent via an email message,the email may have its own unique identifier. In some configurations,the analytics server may use these identifiers to compare files anddetermine whether they are related. For example, a PDF stored on ashared data repository (e.g., Google Drive) may have 3 uniqueidentifiers: (1) a unique URL, (2) a unique ID generated by the shareddata repository, and (3) a unique content-based hash ID generated by theanalytics server. The analytics server may use all the uniqueidentifiers to compare files.

In a non-limiting example, file 1 may be associated with a uniqueidentifier of a related task, a unique identifier of a related email,and a unique identifier of a storage drive. File 2 may be associatedwith a unique identifier of a related email and a unique identifier of arelated task. In some configurations, the analytics server may comparethe unique identifiers of files 1 and 2 (e.g., unique identifier of theemail related to file 1 compared to unique identifier related to file2). When a number of similar unique identifiers satisfy a threshold, theanalytics server may determine that the files are related.

As a result, when two web-based files (e.g., two files saved on athird-party file sharing application) share the same drive identifier(e.g., are saved within the same storage drive or folder), the files maybe associated with similar unique identifiers associated with the folderor the storage drive. Consequently, the analytics server may determinethat they are related.

In some configurations, the analytics server may generate the uniqueidentifier for each file based on a combination of one or more of theabove-described methods. For example, the analytics server may generatethe unique identifier for each file using the hashing protocol, and anidentifier of where the file is saved. As described herein, a “unique”identifier is unique to each file. Furthermore, other elements describedherein can have unique identifiers. For instance, a storage drive onwhich a file is stored may have a unique identifier. In another example,an email sent to a user may have its own unique identifier. Forinstance, the analytics server generates unique identifiers such that aunique identifier cannot be generated for two unrelated files.

When the analytics server determines that the file is a file path,shortcut, link/URL, bookmark, or an identifier of an online file (e.g.,Google Docs®, OnShape®, and the like), the analytics server ay generatethe unique identifier based on the file's URL. For instance, a uniqueidentifier may be embedded within the URL of a file.

At step 230, the analytics server may generate a computer modelcomprising a set of nodes where each node comprises metadata indicatingcontext data of each file within the plurality of files, wherein when afirst unique identifier of a first file matches a second uniqueidentifier of a second file, the server links a first node correspondingto the first file to a second node corresponding to the second file. Theanalytics server may create a computer model comprising a nodal datastructure (or data graph) where each node represents an identified file.The analytics server may store the nodal data structure in the systemdatabase (or any other electronic data repository, such as a cloud basesstorage, local/internal data storage, distributed storage, blockchain,and the like) described in FIG. 1.

The nodal data structure may be a complete map of all the filesidentified in step 210. Each node may also contain metadata furthercomprising historical (e.g., context) data associated with the file,such as the generated unique identifier of the file, title, mime type,file permissions, comments, and the like. The metadata may also indicatea revision history associated with each file. For instance, the metadatamay include timestamp of every revision for each file, a uniqueidentifier (e.g., user ID, IP address, MAC address and the like) of theuser and/or the computing device who accessed and/or revised the file,and the like. Other context data may include, but not limited to, emailidentifiers (e.g., unique email identifiers, sender identifier, receiveridentifier, and the like), tasks associated with the files, useridentifiers, mime type, collaboration information, viewing permission,title of each file, and the like.

The metadata may also include context information associated with eachfile. For instance, the metadata may include email/chat communicationthat are related to each file. In another example, if the analyticsserver determines that a file has been transmitted via an email or otherelectronic communication protocols (e.g., referenced or attached in anemail message, referenced in a chat session, and the like), theanalytics server may include a transcript of the electroniccommunication (e.g., body of the email) in the node, as metadata. Theanalytics server may index each node based on its associated metadataand make each node searchable based on its metadata.

The analytics server may compare the unique identifiers for of all thefiles identified in step 210. When the unique identifiers of two or morefiles match, the analytics server may link the nodes representing thetwo or more files in the above-described nodal data structure. A link(or edge) may connect similar or associated nodes within a nodal datastructure such that the analytics server may retrieve context metadatamore efficiently. Edges can be directed, meaning they point from onenode to the next, or undirected, in which case they are bidirectional.The analytics server may use different directed or undirected edges tolink different nodes. Edges between nodes can be given specialclassifications, including but not limited to “copy,” “version,”“parent,” “child,” “derivative,” “shared email,” “shared task,” “sharedtag,” and “shared folder.” The analytics server may also combinerelevant metadata from related files and display to the client (e.g.,files A and B are copies of each other, and file B is attached in anemail message. When user previews file A, the email message for file Bcan be displayed). As described herein, the analytics server may use thelinks to identify a latest version of a related family of files.

Referring now to FIG. 3, nodal data structure 300 represent a nodalstructure created based a set of identified files and related nodesconnected via different edges. As depicted in FIG. 3, the analyticsserver identifies 17 files and creates a node for each file (nodes 310a-i and nodes 320). For instance, node 310 b represent a pdf file storedlocally on a computer of an entity (e.g., computer within a network ofcomputers); node 310 h may be a PowerPoint Open XML, file stored on acloud storage accessible to another computer within the same network. Asdescribed above, each node may include an indication of a location wherethe file is stored. For instance, node 310 e may represent a DOCX filestored in a local database. Therefore, node 310 e may include metadatacomprising a path of the DOCX file to the local database. Additionally,as described above, multiple nodes may be linked together. For instance,links 330 a-h connect nodes 310 a-i that represent related files.Furthermore, because the analytics server identifies that nodes 320 arenot related, the analytics server does not link nodes 320, as depictedin FIG. 3. As described above, a “file” may also refer to a pathassociated with data. For instance, a file may refer to the underlyingdata regardless of where the data is stored and/or hosted or theapplication needed to view the data. For instance, a file may include alink (directing a user to view the underlying file). The file may onlyexist as on online file and may only be accessible through an internetbrowser or mobile application, and in some cases, it may not be able tobe downloaded to a local machine without some type of conversion (e.g.,Google Docs® or Google Slides® only exist online, but can be downloadedas DOCX or PPTX).

A path may specify a unique location of a file within a file system oran electronic data repository. In some configurations, a path may pointto a file system location by following the directory tree hierarchyexpressed in a string of characters in which each component of thestring, separated by a delimiting character, represents a directory. Insome configurations, the analytics server may use a uniform resourcelocator (URL) to identify each file's stored location. For instance,when a file is stored onto a cloud storage or when a file is stored ontoa third-party shared drive, the analytics server may include a URL ofthe file in the nodal data structure.

In some configurations, and as described above, the nodal structure maynot include the identified files and may only comprise nodesrepresenting file locations (and other metadata) and edges representinghow different files are related. For instance, instead of storingmultiple files (and possibly multiple version of the same file and/orrelated files) the analytics server may only store the nodal datastructure in a local or external database. In this way, the analyticsserver may conserve significant storage space because storing arepresentation of a file requires significantly less storage capacitythan storing the file itself. Furthermore, as described herein,identifying relationships (and executing various protocols to identifycontext, relationship or other related data for each file) is much lesscomputationally intensive when performed on the above-described nodaldata structure than executing the same protocols on the filesthemselves. In this way, the analytics server may conserve significantcomputing and processing power needed to provide file managementservices. As a result, the analytics may deliver results in a faster andmore efficient manner than provided by conventional and existing filemanagement methods.

As depicted, the nodal data structure 300 may include all dataassociated with users' workflow. For instance, the wild the nodesdescribed above represent different files, nodes 340 a-e may representworkflow components generated because of users' work. For instance, thenode 340 a corresponds to organization chart generated based on customerrelationship management (CRM) software solution (internal or third partysolution). The node 340 b may correspond to new employees hired wherethe data is generated based on an applicant tracking system softwaresolution (internal or third party solution).

The node 340 c may correspond to one or more tasks associated with oneor more employees. For instance, an organization may use an internal orthird party software solution to help employees execute various tasksefficiently. The analytics server may identify the tasks I may generatea node for each task. Accordingly, the analytics server may identifythat one or more tasks may be related to one or more files and/or workcomponents within the nodal data structure 300.

The node 340 d may correspond to a contact within a contact list of anemployee/user. The analytics server may scan various software solutions(internal and/or external) and may identify contacts associated witheach user/employee. The analytics server may then generate a node foreach contact accordingly. As described herein, the analytics server maythen identify that a contact is related to another node that mayrepresent a file and/or a workflow component within an organization. Thenode 340 e respond to one or more messages generated and transmittedamong users, such as emails or any other messages (chat applications).

As depicted, the analytics server may not differentiate between filesstored on data repositories accessible to one or more users and workflowcomponents generated/accessible to the users. The analytics server mayexecute various analytical protocols described herein to identifyrelated nodes and may use edges to link or merge the related nodes. Forinstance, the analytics server may use edges 350 a-c to connect relatedwork component nodes. The analytics server may also use age 360 twoconnect node 340 c (workflow component) to the node 310 d, andindirectly connect node 340 d to nodes 310 a, 310 b, 310 e, 310 c, 310h, and 310 i.

Referring now to FIG. 4, another illustration of a nodal data structureis illustrated. Nodal data structure 400 represents a clustered nodaldata structure where the analytics server clusters related files intodata clusters 410 and 420. As described above, each node within the datastructure 400 represents an identified file. Each node within the nodaldata structure 400 may include metadata associated with each respectivefile (e.g., indicating the location, type, historical data, and contextdata associated with the file). Upon identifying relationships betweenfiles, the analytics server may generate a cluster that represents allrelated nodes/files. For instance, the analytics server may determinethat a unique identifier of a first pdf file on a first computer(represented by node 410 a) matches the unique identifier of a secondpdf file stored on a cloud storage accessible by a second computer(represented by node 410 e). In response to identifying thisrelationship, the analytics server may link nodes 410 a and 410 e.Subsequently, the analytics server identifies that a PowerPoint file(represented by 410 b) and a web link (represented by node 410 d) weretransmitted via an email message (represented by node 410 c) that alsoincludes the pdf file represented by node 410 a. In response, theanalytics server may link node 410 a to each of nodes 410 b, 410 c, and410 d. The analytics server may further link node 410 b and node 410 dto node 410 c because files represented by nodes 410 b and 410 d may berelated. As seen in FIG. 4, the analytics server creates data cluster410 that includes all the above-mentioned nodes.

The analytics server may execute similar protocols as described above toidentify interrelated files and generate multiple clusters. For example,the analytics server may cluster nodes 420 a-c into cluster 420.Furthermore, one or more nodes within different clusters may also belinked, as represented by edge 430. As described above, data cluster 410and 420 each represent a related family of files. Different clusters maybe stored into different shards to optimize storage and efficiency whenidentifying nodes (e.g., step 260).

In some configurations, the analytics server may consolidate allmetadata associated with each identified file to identify all relatedusers and content. In a non-limiting example, the analytics server mayidentify that user 1 sent File A in an email (along with File B) to user2; user 2 downloaded File A and stored File A in a folder with File C.As a result, the analytics server may connects nodes representing filesA, B, and C. When a user accesses any of the files A, B, or C, theanalytics server notifies the user regarding the relationship betweenthese files. As described herein, the analytics server may onlycustomize the notifications in accordance with each user's accesspermissions. For instance, if a user is not authorized to access (orview) File B, the analytics server may only display notificationsregarding Files A and C to the user.

Referring back to FIG. 2, at step 240, the analytics server mayperiodically scan the plurality of electronic data repositories, tomonitor the first file and the second file. The analytics server mayperiodically scan the electronic repository as discussed above. In someconfigurations, the frequency of data scanning may be predetermined ormay be adjusted by an administrator in accordance with an entity'sneeds. For instance, an administrator may require the analytics serverto scan the electronic data repositories every week, day, or multipletimes per day depending on their unique needs and data sensitivity.

In some configurations, the analytics server may only scan theelectronic data repositories in response to receiving a notification ora trigger from another server, such as an email message, a third partyAPI or a data management server operationally in communication with adata repository. The analytics server may use application-programminginterfaces and/or web hooks to achieve the above-described results. Forinstance, as described above, the analytics server may utilize variousAPIs to monitor the identified files. Therefore, the analytics servermay receive a notification, from an API, that a file has been revised.In some embodiments, the API may transmit details of the revisions(e.g., user name, timestamp, and the like). In some other embodiments,the API may not be configured or authorized to transmit such detaileddata. In those embodiments, in response to receiving the notificationfrom the API indicating that a file has been revised, the analyticsserver may further scan the electronic repository (or otherrepositories, such as email, third-party applications, and otherrepositories) on which the file is stored. As a result, the analyticsserver may retrieve revision details associated with the revised file.

At step 250, the analytics server may, for each instance of the serverdetecting a related file to the first file, merge the first node wherethe merged first node corresponds to a context data of related files(e.g., storage location and a timestamp of the related file to the firstfile and context data of the first file). In response to identifying arevision or a modification to a file, the analytics server may revisethe nodal data structure accordingly. For instance, as described above,the analytics server may identify that a file has been revised ormodified by a user within the network. The analytics server may thenupdate the metadata associated with the node and the respective edgerepresenting the revised file with revision/modification data. Forinstance, the analytics server may update the node metadata with a useridentifier, timestamp, content of the revision, and other historicaldata. When the analytics server identifies a revision of the file, therevised file is no longer a “copy” of the original file. Therefore, theanalytics server updates the metadata of the revised file from “copy” ofthe original file to a “version” of the original file.

In some configurations, the analytics server identifies related filesbased on their context data stored onto one or more nodes representingeach respective file. For instance, in some embodiments, the analyticsserver may update or revise the nodal data structure by generating newnodes and/or edges. For instance, when the analytics server discoversthat a user has attached a file in an email communication, the analyticsserver may generate a node that represents the email communication. Theanalytics server may then update the node's metadata with informationassociated with the email communication (e.g., timestamp, email body,email address, sender user identification, receiver's useridentification, and other context data described herein).

In some configurations, if the email communication includes other filesor web links, the analytics server may create individual nodes for otherrelated files. For instance, and referring to FIG. 3, node 310 drepresents email communication between two users where one user attacheda pdf file represented by node 310 b. Furthermore, in the emailrepresented by node 310 d, the user also attached a document representedby node 310 e. As depicted in nodal data structure 300, the analyticsserver may also link the above-described nodes using edges 330 b and 330e. As a result, the analytics server may continuously and iterativelyupdate the nodal data structure. Therefore, the nodal data structure isa dynamic computer model, which adapts to user interactions.

In some configurations, the analytics server may combine metadata frommultiple related nodes into a single metadata file. Instead of each nodehaving a separate metadata file, the analytic server may create a singlemetadata file associated with a file where the metadata file containsall metadata associated with all (or a given subset of) related nodes.For instance, if File A is related to Files B-F, the analytics servermay create a single metadata file and combine metadata associated withFiles A-F. Upon identifying additional related files (or other relateddata, such as tasks, messages, and the like), the analytics server mayupdate the metadata file accordingly.

In some configurations, the analytics server may augment the metadatafile using public data. For instance, in addition to scanning theelectronic repositories described herein, the analytics server may alsoscan publicly accessible repositories (e.g., public websites or otherpublicly accessible data). When the analytics server identifies a publicfile related to an identified file, the analytics server may augment theidentified file's metadata file. For instance, the analytics server mayidentify a video file stored locally onto a user's computer. Theanalytics data may then determine that the identified video is similarto a video publicly shared on a website (e.g., YouTube®). Consequently,the analytics server may augment the identified video's metadata fileusing data associated with the publicly share video (e.g., URL of thevideo).

As described above, the analytics server may use two methods to mergetwo nodes where the two nodes represent two related files (e.g., copiesof the same file, and/or files that have been determined to be related).First, the analytics server may create a new node for the newlydiscovered related file and may link the nodes together. Second, theanalytics server may combine the metadata of the newly discovered filewith the original file (e.g., create a single metadata file and combineall metadata corresponding to context information of the related file tothe original file). The analytics server may also use one or both of theabove-described methods when merging two nodes.

In a non-limiting example, the analytics server may identify two copiesof the same file where the first file is stored on a local database andthe analytics server identifies the second file as an attachment in anemail sent from a first user (or when the file path is transmittedthrough the email) to a second user. The analytics server may thencombine the metadata associated with the email (e.g., email message,sender identifier, receiver identifier, content data, mailboxidentifier, and the like) with the metadata associated with the firstfile (e.g., name of the local system, size, data modified, folder, andthe like). For instance, the analytics server may generate a singlemetadata file that contains metadata associated with the first file andthe second file. The analytics server may then use the combined metadatafile to identify related files, build, and suggest relationships betweenthe first/second file and other files identified. In another example,the analytics server may generate a node that represents a file. Whenthe file is attached in a task, the analytics server may generate a newnode for the task. Therefore, the analytics server may operate in twoways: 1) creating a new node for a related file; and/or combining thecontext into a single metadata file for a file. In some configurations,the analytics server may use a combination of the above-describedmethods.

At step 260, the analytics server, in response to receiving from anelectronic client device, a request to access the first file or thesecond file, may identify all or some of the related files to the firstfile or the second file in accordance with a latest timestamp of thefirst node or the second node. The analytics server may receive arequest to access a file. For instance, a user may click or otherwiseinteract with a file and transmit a request to access the file (e.g.,view, edit the content, revise the name, send, or otherwise interactwith the file). In some embodiments, the user may access a sharedthird-party application and transmit a request to access the file. Inresponse to determining that the user has requested to access a file,the analytics server may identify a node within the nodal data structurethat represents the requested file.

Upon retrieving the identified node, the analytics server may retrieveall related nodes and metadata associated with the identified nodesand/or the related nodes within the nodal data structure. The analyticsserver may analyze the metadata retrieved and identify all related files(including a latest version of the requested file). For instance, theanalytics server may retrieve all timestamps for all nodes related to anode representing the requested file. The analytics server may thencompare all timestamps to identify a latest version of the requestedfile. The analytics server may also identify relationships between filesby determining relationships between different nodes representing thosefiles. These relationships (identified related nodes) may be displayedon the GUI viewed by the user. For instance, when a user accesses afile, the analytics server may identify the original file, differentcopies, versions, derivative, shared tasks, shared comments, sharedemails, shared tags, and shared folders that are associated with thefile. The analytics server may also display these related items on theGUI, as depicted in FIG. 5, FIGS. 6A-C, and FIGS. 7A-B.

At step 270, the analytics server may retrieve all files related to therequested file-in accordance with the storage location of the latestversion of the requested file identified based on the updated firstnode- and may transmit the retrieved latest version to the electronicclient device. As described above, the analytics server may identify anode that represents a latest version of the requested file.Subsequently, the analytics server may use the path stored within themetadata of the identified node to retrieve a latest version of therequested file and transmit the retrieved latest version to theelectronic client device. The analytics server may populate a graphicaluser interface that displays various information associated with therequested file. As described herein, the analytics server may directlydisplay graphical user interface on the user's computer or mayincorporate the graphical user interface into a third-party application,such as a third-party email application. In some embodiments, theanalytics server may display all versions of the requested file. Forinstance, in addition to displaying the latest version, the analyticsserver may display an option for the user to access an older version ora related version (e.g., a version of the requested file that was sharedin an email). Using the above-described options, the user may access andinteract with an older version of the request file.

Using the methods and systems described herein, the analytics serveralso allows users to interact with files stored onto disparateelectronic data repositories using the same web-based application (e.g.,browser). For instance, the analytics server may identify a noderepresenting the requested file stored onto a first electronic datarepository, a first related node representing an email stored onto asecond electronic data repository that included the requested file as anattachment, and a related file stored onto a third electronic datarepository. The user may access all the above-mentioned files (requestedfile, related file, and related email) from the same browser even thoughthe above-mentioned files are stored onto different electronicrepositories.

To retrieve all related data (e.g., all related files including thelatest version of a file), the analytics server may utilize the ApacheLucene project's open source enterprise search platform, Solr®, forfull-text indexing and searching. As described above, the analyticsserver may index every node within the nodal data structure, whichallows the nodes to be searchable by their associated metadata.Furthermore, executing the above-described indexing and searchingprotocol on the nodal data structure, as opposed to all files stored ina central data repository, allows the analytics server to identify nodesand retrieve related metadata in real-time or near real-time.

Referring now to FIG. 5, an example of a graphical user interfacedisplaying file context information is illustrated. In someconfigurations, the analytics server may display GUI 500 directly on auser's computer. For instance, when a user interacts with a file (e.g.,clicks on a file and request the file to be opened), the analyticsserver may display the GUI 500 on the user's computer. In the depictedembodiment, a user requests File XYZ to be opened. As a result, theanalytics server displays the GUI 500. In some other embodiments, theanalytics server displays an indicator associated with File XYZ anddisplays the GUI 500 in response to the user interacting with theindicator.

The GUI 500 may display filename and file types of the requested file inthe graphical component 510. For instance, graphical component 510indicates File XYZ (file name) and further indicates that File XYZ is aPDF file (file type). The GUI 500 may also comprise an interactivegraphical component 520. When the user interacts with the interactivegraphical component 520 (e.g., by clicking), the analytics server maydisplay content of File XYZ. Another interactive graphical component 560may allow the user to share File XYZ with other users. For instance,when the user interacts with the interactive graphical component 560,the analytics server may generate an interactive link configured todirect the recipient to the File XYZ (e.g., a URL or other pathsindicating a storage location of the File XYZ).

In some configurations, the analytics server uses a messagingapplication to transmit the requested file to other users. When a userinteracts with the graphical component 560, the analytics server maydirect the user to the messaging application (or otherwise referred toas the sharing panel). Referring now to FIG. 7A and FIG. 7B, an internalmessaging application is illustrated, in accordance with an embodiment.The internal messaging application may include a graphical component700, where the user can search for other users/employees/contacts,identify one or more recipients, and share the file with the identifiedrecipients. When the user interacts with the graphical component 710(“add message), the analytics server may display a graphical component720 where the user can add customized messages and other attachments.

In some embodiments, an organization may use an internal (or thirdparty) software solution to aggregate and consolidate all tasks,messages, and contacts across all connected tools (e.g., softwareapplications used by users/employees). The analytics server may use thatsoftware solutions to transmit he requested file to the users.Additionally or alternatively, the analytics server may integrate itsservices into other software application. For instance, the methods andsystems described herein can be implemented into other applications,such that users no longer need to use multiple applications.

The internal messaging application may also provide the user with theoption to transmit the file via any third-party email or messagingsystem previously connected to the analytics server. As a result, theuser may draft an email using the internal messaging applicationprovided by the analytics server (e.g., while interacting with the file)and the analytics server may transmit the email by communicating withthe third-party email application. The above-described method creates asignificant positive user experience because the user is no longerrequired to interact with multiple interfaces and applications.

As described above, the analytics server/messaging application may alsoprovide users with the option to transmit other workflow components. Forinstance, users may be able to share tasks, contacts, messages, and thelike. For any unit of work, a user can easily share and embed the unitof work within a message. For example, conventionally, users may shareemails with other users by forwarding the emails. However, usingmethods, systems, and software solutions described herein, a sender canshare an email with a recipient else not included in the email message,without forwarding the email.

In some configurations, the analytics server may also displayinterrelated relationships associated with the requested file. Forinstance, when the user interacts with the “advanced” hyperlinkdisplayed in the graphical component 700, the analytics server maydisplay the graphical component 730 where the analytics server displaysfiles and activities related to the requested file. For instance,graphical component 740 displays information regarding the first copy ofFile XYZ (e.g., data regarding storage area and the date of the lastmodification), graphical user component 750 displays data regarding the“creator” of the first copy of File XYZ, graphical component 760displays data regarding a group of users who are permitted to accessFile XYZ, graphical component 770 displays data associated with a secondcopy of File XYZ (including graphical component 780 displaying thecreator).

Using the graphical component 730, the user can also manage all of thepermissions for all of the different versions and files related to FileXYZ simultaneously. For instance, the user can revise a list of userswho can access File XYZ by interacting with the drop down menu displayedin the graphical component 760.

As described above, the analytics server may customize each GUI andgraphical component described herein in accordance with the user's role.For instance, while the analytics server may identify that the File XYZhas 10 related copies/versions, the user may only view thecopies/versions and relevant metadata to the two copies/versions forwhich the user is authorized to view.

The GUI 500 may also comprise interactive component 530 comprising astatus indicators illustrating a status of File XYZ. For instance, theinteractive status indicators may indicate whether File XYZ is muted or“watchlisted.” A file can be muted, followed, or watchlisted. If a userhas never interacted with a file, the user may never get any updates ornotifications regarding that file. If a user has interacted with a file(e.g., modify a file, send a file, received a file, create a file,change permissions of a file), the user may get key updates regardingthat activity around that file. In some embodiments, theupdates/notifications may be transmitted to each user via a feed, suchas an RSS feed delivered to each user, or via daily email overviews ofall relevant updates. Users can also choose to manually follow orwatchlist a file. When a user watchlists a file, the user may receivenotifications, regarding all activity related to that file, includingall of its copies and versions.

Through advanced settings, the user may determine to “hide” or “ignore”a file, meaning that the analytics server may not compare the file toother files stored within other electronic data repositories.Additionally or alternatively, the analytics server may not monitor ahidden or ignored file. However, when a file is watchlisted, theanalytics server periodically monitors the file, as described above. TheGUI 500 provides the user with the option to change the status of FileXYZ by interacting with the interactive indicators.

The GUI 500 may also comprise a graphical component 540, which displayscontext information associated with File XYZ. Data displayed in thegraphical component 540 may be retrieved from metadata stored within thenodal data structure and may include type, size, timestamp of creation,owner, an indicator of storage location, and different tags associatedwith file XYZ. Graphical component 540 may provide the user with theoption to delete, add, and/or revise different tags. In someimplementations, users can add custom metadata fields. In someconfigurations, the analytics server may further identify suggested tagsfor file XYZ based on related files and their respective tags. Theanalytics server may also suggest tags based on file content, filerelationship, a user identifier, tags gleaned through user activity, andmore.

The GUI 500 may further comprise a graphical component 550, whichdisplays all the versions and copies (including the latest version) ofthe requested file permitted to be viewed by the user. For instance, thegraphical component 550 a displays that user 1 uploaded File XYZ one onJun. 26, 2018. The graphical component 550 b displays that the analyticsserver has identified a related file (e.g., a file with a matchingunique identifier, as described above). As depicted in graphicalcomponent 550 b, the related file name is File XYX, which is also a PDFfile. The graphical component 550 b displays that File XYX wasidentified in an email communication between user 1 and user 2 on Apr.12, 2018. Furthermore, graphical components 550 c-e indicate where FileXYZ and File XYX are stored respectively. Even though the embodimentdescribed here uses a related file that is the same file type as therequested file, in some configurations, a related file may not be thesame file type. For instance, a PDF file stored in a first electronicrepository may be stored in a second electronic repository as a documentfile.

As depicted in 550 d, sometimes a file (or other workflow data) can bestored in a third-party data repository where the analytics serveridentifies the file by integrating with the third party. For instance,the file indicated in the graphical component 550 d may be associatedwith third-party CRM and/or ATS software. The analytics server mayprovide additional metadata by integrating with these third-partysoftware solutions, which can help contextualize files/workflow. Forinstance, if a file corresponds to a resume, the file may be referencedin an ATS software solution, which may be internal/native to anorganization or may be a third party ATS solution. If a user interactswith the resume and his/her email, the analytics server may displaycontext data associated with the resume, such as whether the resume isassociated with a new hire in an ATS software (e.g., whether the resumeis being considered for a first round or a second round of interviews),whether the analytics server has identified any additional nodes orfiles associated with the resume, and the like. In some embodiments, theanalytics server may direct the user to the ATS software solution wherethe user can obtain more information and review related files/data.

In some configurations, each graphical component shown may beinteractive. For instance, the user may click on any of theabove-described graphical components and the analytics server may directthe user to the file represented by each respective graphical component.For example, if the user clicks on the graphical component 550 c, theanalytics server may retrieve a location of the email communicationbetween user 2 and user 1 and may change component 540 to displaydetails regarding the graphical component 550 c. The analytics servermay also open the message tab of GUI 500 and display the emailcommunication on the GUI 500, as shown in FIG. 6B.

The graphical component 550 may also display a suggested related file(e.g., a version). As described above, the analytics server mayperiodically monitor all files accessible by all computers within anetwork (e.g., all computers within an entity or all registered users)to identify related files/content. When the analytics server identifiesthe related file/content, the analytics server may display theidentified related file/content in the GUI 500. For instance, graphicalcomponent 500 f indicates that the analytics server has identified FileXYP, which is also a PDF file and is identified as related to the fileXYZ by the analytics server. The graphical component 500 f alsoindicates that file XYP was received in an email correspondence fromuser 3 user on Oct. 1, 2018. This is possible because the analyticsserver determines that file XYP may, for example, share an email threadwith file XYZ and due to other similar metadata, it establishes thatthey are likely versions of each other. Alternatively, user 3 could havebeen working on file XYZ before (e.g., satisfying a predetermined timethreshold) saving file XYP and therefore, by closeness of activity alongwith other considerations, the analytics server suggests file XYZ andXYP as versions of each other.

Referring now to FIG. 6A, another example of a graphical user interfacedisplaying file context information is illustrated. As described above,the analytics server may incorporate a graphical user interfacedisplaying file previews and context information into a third-partyapplication, such as a third-party email application, a third-party filesharing application, or a third-party project management tool.Additionally or alternatively, the analytics server may generate aweb-based application (e.g., a website) and/or native desktop and mobileapplication where registered users can login to access and/or managedifferent files. The web-based application may incorporate otherthird-party applications, such as email applications.

GUI 600 represents a graphical user interface generated and operativelycontrolled by the analytics server. In the depicted embodiment, theanalytics server incorporates data from third-party email, messaging,and other collaborative applications into the GUI 600. For instance, GUI600 corresponds to a user who has previously registered and connectedhis or her third-party email, Slack® account, and Asana® account. As aresult, the analytics server updates the graphical component 610 in realtime by continuously querying the third-party email, messaging, projectmanagement, and cloud storage applications and populating the graphicalcomponent 610. As described above, the analytics server may utilizevarious web hooks and/or APIs instead of continuously scanning datarepositories.

The GUI 600 depicts an example where a registered user has requested toaccess File XYZ by searching for the File XYZ using the search bar 690.As described above, the analytics server may perform the search using asearch platform (e.g., Solr®), which maintains an index of all files,messages, tasks, and more across all of the data repositories includingall third-party data sources. In some embodiments, the analytics servermay augment the search results through an understanding of theinterconnected nature of the nodes (e.g., how interconnected are thenodes). For example, the analytics server may execute a predeterminedranking algorithm and display the search results accordingly. Uponidentifying File XYZ (by identifying a node that represents File XYZwithin a nodal data structure), the analytics server populates thegraphical component 620 that displays a “quick view” of the file XYZ.

The GUI 600 may also comprise a graphical component 640 that includesinteractive hyperlinks/components where a user can open File XYZ orchange the file's user permission and share a link directing recipientsto access File XYZ. In this illustrative user interface, the graphicalcomponent 640 includes hyperlinks for info, messages, tasks, andtimeline. Activating the info hyperlink can display information such astimestamp, author, version number, or the like. Activating the messageshyperlink, as shown in FIG. 6A, can display messages containing FileXYZ. Activating the tasks hyperlink can display tasks that contain orare related to File XYZ. Activating the timeline hyperlink can display alisting of chronological events and related projects associated withFile XYZ.

Activating the deals hyperlink can display a list of all the deals afile is related to in CRM or other sales software (e.g., third partysoftware), as well as other related data (e.g., what stage it is in, thepeople involved in those deals). Similarly, other hyperlinks could beadded, such as project that correspond to the different data sourcesthat might be connected to the analytics server. The GUI 600, includinggraphical component 640, thereby allows a user to consolidate and manageall updates, notifications, contacts, messages, tasks, and content on asingle user interface in a more user-friendly manner.

The GUI 600 further includes a graphical component 630, which indicatessuggested related files. In some configurations, the analytics servermay display the suggested and/or related files, as shown in graphicalcomponent 630. The suggested (e.g., using soft factors) and/or related(e.g., using hard factors) files may be grouped and/or sorted based ondifferent customizable categories. For instance, other files that areattached in the same email thread as File XYZ and other files that are apart of the same deal as File XYZ in a CRM could be grouped together.Furthermore, related files may be sorted based on file type. Thegraphical component 630 may also include “quick view” access todifferent versions of File XYZ. For instance, graphical component 630comprises version 631 from Oct. 26, 2018, version 632 from Oct. 28,2018, and version 633 from Oct. 1, 2018.

The GUI 600 further includes a message history of File XYZ, asillustrated by the graphical component 650-680. As described above, eachgraphical component within the GUI 600 may comprise an interactivecomponent configured to direct the user to a file, a message, a task, ora person related to File XYZ. The timeline tab includes a history of alluser activity events related to File XYZ and all of its versions,including but not limited to email messages, edits, downloads, andviews.

For instance, the graphical component 660 indicates that File XYZ isassociated with 12 comments on a third-party project managementapplication. When the user clicks or otherwise interacts with thegraphical component 660, the analytics server displays each comment onthe GUI 600. The user can easily respond to comments from any thirdparty system (e.g., a task management application) in which the file wasuploaded to, or used in a chat conversation. In another example, whenthe user interacts with the graphical component 650, the analyticsserver displays the email communication between the user and John Smith.For instance, when a user interacts with the graphical component 650,the analytics server may display a new graphical user interface 601. Theanalytics server may use a unique email identifier to retrieve the emailvia IMAP and/or API calls and to display the related email message orthread in full, as shown in graphical component 602. Using the featuresdisplayed in graphical component 602, the user may be able to respond inline, and click and drag other versions and/or related files into themessage to attach them. In another example, when the user interacts with“task” option in the graphical component 640, the analytics server maydisplay the GUI 604 where the graphical component 604 displays differenttasks related to the requested file (e.g., tasks that have a file orfolder as an attachment).

In a non-limiting example, the analytics server scans data accessible toall computers within a company where the data is stored onto multipleelectronic repositories. Upon identifying all files stored ontodifferent electronic repositories, the analytics server executes ahashing protocol for each identified file to generate a uniqueidentifier for each file. For references to files or websites (e.g.,links shared in email messages, website bookmarks, file shortcuts,etc.), the analytics server identifies different unique identifiers suchas file path, a URL, or a third-party party system's unique ID. Theanalytics server further creates a nodal data structure where each noderepresents an identified file and contains metadata indicating dataassociated with each file. The analytics server further compares theunique identifiers for all the identified files. When two files havematching unique identifiers, the analytics server updates the nodal datastructure by linking the nodes representing the two files as copies ofone another.

The analytics server may further scan other data sources with variedmetadata, such as messaging systems, task management tools, and thelike. The analytics server may also update the nodal structure to relate(e.g., link/merge) multiple files that share emails, messages, tasks,and activity events for example. Similarly, the analytics server cangenerate unique identifiers for other types of data/units of work,including but not limited to messages, tasks, contacts, calendar events,and notifications in order to link/de-duplicate units of work that arethe same across different systems. For example, a URL that points to atask within a project management tool can be found within an emailmessage and that same task could be otherwise indexed by the analyticsserver directly from the project management tool (e.g., via the projectmanagement tool's API), such that the analytics server links the uniqueURL for that task with the indexed task in the database.

The analytics server periodically scans (or receives updates via API orweb hooks) the electronic repositories and iteratively updates/revisesthe nodal data structure based on comparing unique identifiers of allthe units of work and linking appropriate nodes. When a user operating acomputer within the company requests to access a unit of work, theanalytics server retrieves the nodal data structure and identifies anode representing the requested unit of work. The analytics server alsoidentifies related nodes and context data stored on the node's metadata(representing the requested unit of work). The analytics server thendisplays detailed information associated with the requested unit ofwork. For instance, the analytics server displays all the versions(including a latest version) of a requested file, related files, relatedtasks, related people, email communication related to the requestedfile, and the like.

In some embodiments, the electronic platform provided by the analyticsserver may allow users to access any workflow component using the nativeapplications. For instance, as depicted in FIG. 30, the analytics servermay display the graphical element 3002 where the user can interact witha workflow component's name (e.g., file name) and the analytics servermay direct the user to the third-party application that hosts thatparticular workflow component. Using the depicted method, users do notneed to activate the third party application to accesses files or otherworkflow components.

In addition to identifying related files and workflow components usingexplicit relationships (hard factors) described in FIG. 2, the analyticsserver may also use a variety of methods to identify related files andworkflow components using implicit relationships (soft factors). FIGS.8-12 describe a variety of methods utilized by the analytics server toidentify related content (e.g., files and workflow components) usingdata associated with each file. The analytics server may monitor contextdata associated with files and workflow components. For instance, theanalytics server may monitor how various files were shared among usersand/or edited/accessed by each user. The monitored context data issometimes referred to herein as the soft factors or implicit data. Howusers interact with files and workflow components may indicate whetherthey are related to each other or to a particular project.

As will be described herein, the analytics server may use artificialintelligence (AI) and machine learning (ML) modeling techniques toidentify related content. For instance, the analytics server maycalculate a likelihood of relatedness for different files based onimplicit data (e.g., user interactions). If the analytics serveridentifies, within a reasonable degree of certainty, that a nodecorresponding to a file or workflow component is related another node,the analytics server may link the related nodes.

As illustrated in FIG. 8, the analytics server may use a three-stepprocess to identify related content and to establish relationshipsbetween related nodes within the nodal data structure. At step 810 (asillustrated in FIG. 9), the analytics server establishes explicit (e.g.,known data or hard factor) relationships between nodes. At step 820,(illustrated in FIG. 10) the analytics server establishes relationshipsbetween nodes based on implicit data (e.g., soft factors) by calculatinga relative distance/measure between nodes. At step 830, (illustrated inFIG. 11, the analytics server reclassifies types of relationshipsbetween nodes that are close to one another according to user needs/usecases. The reclassification can put more abstract labels that mightbetter align with user needs/use cases.

Upon identifying related nodes, the analytics server may append theidentified relationships as metadata to each node. For instance, theanalytics server may identify related information (e.g., electronicmessages, files, other users, tasks, calendar events, and notes) foreach node. The analytics server may then generate metadata for each nodethat corresponds to the identified relationships. In someconfigurations, the analytics server may also de-duplicate nodes bycomparing unique identifiers, such as email addresses for contacts togenerate a more efficient graph (nodal data structure).

Although examples and embodiments described herein relate to files andworkflow components, the methods and systems described herein can beconfigured, such that the analytics server identifies related nodes thatcorrespond to a person, user, contact, message, task, etc.

At step 810, the analytics server may use a variety of methods toestablish a connection between nodes based on explicit and known contextdata associated with different nodes. For instance, the analytics servermay use method 200 described in FIG. 2 to retrieve relevant dataassociated with each node (corresponding to different files and/orworkflow components) and may establish proper connection within thenodal data structure accordingly.

At step 810 (e.g., explicit relationships or hard factors), theanalytics server connects different nodes corresponding to related filesthrough explicit relationships. The most basic relationship considersthe MD5 hash fingerprints of each file to identify all copies of onefile across different systems (e.g., data repository and/or softwareapplications). If the hashes of the two files match, the analyticsserver assumes that they correspond to the same file. Other types ofrelationships are built between files that are attached in the sameemail or email thread, files that are attached in the same task, filesthat are referenced in another's comments, and so forth. Activity andrelationships from different copies of the same file are consolidated toenable more relationships to be created.

Referring now to FIG. 9 a visual representation of connected nodes isillustrated. FIG. 9 illustrates a set of nodes where the analyticsserver identifies as related based on known factors, such as beingattached in a monitored electronic communication or having a matchingunique identifier. As depicted, the analytics server may establish knownlinks between nodes in the graph 900. Each node may represent a file,user, or a workflow component. For instance, the node 910A represents auser, nodes 910B/C/F represent different files, node 910D represents atask, and node 910E represents an email message.

Using methods and systems described herein, such as method 200, theanalytics server retrieves context data associated with each node. Theanalytics server uses MD5 hashing methods to identify that node 910F isrelated to node 910C. The analytics server then connects nodes 910D to910F because a copy of the file corresponding to the node 910F has beenattached to the task corresponding to the node 910D. The analyticsserver also connects nodes 910E and 910B to the node 910F because anemail attached the same file as 910F. Finally, the analytics serverconnects the node 910A to the node 910F because a person correspondingto the node 910A edited the file corresponding to the node 910F.

Referring back to FIG. 8, at step 820, the analytics server may connectvarious nodes using implicit relationships among files, users, andworkflow components. This process is also referred to herein as implicitrelationships or soft factors. In this step, the analytics servergathers and stores all user activity across systems and services. Themethods and systems described herein leverage this monitored andcollected user activity data to deduce implicit feedback, which is thenused to expose unknown relationships between files. As a result, theanalytics server does not have to rely on known relationships (hardfactors) or user input to link different nodes. Implicit feedback canalso augment explicit feedback/relationships. Moreover, the analyticsserver is also able to find relationships between nodes that are unknownto users.

As will be described herein, the analytics server may use variousscoring algorithms to identify a likelihood that a pair of nodes arerelated. If the likelihood of relatedness satisfies a threshold (e.g.,two nodes are highly likely related, such as 80% possibility that thenodes are related), the analytics server may revise the nodal datastructure by linking and/or merging the corresponding nodes. If thelikelihood of relatedness does not satisfy the threshold (e.g., there isa 50% possibility that the nodes are related), the analytics server maysuggest the nodes as related or may not link the nodes.

As described above, the analytics server periodically monitors files anduser activity within a network. The analytics server periodicallycollects user data (e.g., edit histories, communication histories, filesrevisions, time of revision, electronic communications, location/devicemetadata, current open window (e.g., analytics server windows/smartwindows) and generates a nodal network that represents relationshipsbetween files, messages, tasks, people, web sites, and other types ofinformation.

The analytics server may then consolidate timelines of activity aroundeach user. Consolidating timelines may be technically challenging. Forinstance, a third party data repository application (e.g., Google Drive)may not currently give third party applications (e.g., analytics server)information related to what a given event represents. Instead, the thirdparty data repository application indicates to third party applicationsthat there was a change to some file, without any information about whatthat change was or who performed the action. Therefore, in order todiscern the details around who performed what action, the analyticsserver needs to execute comparisons protocols and data correlations.

In a non-limiting example, the analytics server has previously indexedFile A, and in that record, File A is shown as a private file withoutany shared permissions. Therefore, if the analytics server identifiesthat a “change” was made to File A as described above, it is able tocompare the previous version of File A with the new version of File Aand determine that the permissions were changed and that the originalowner shared the File A with a new “person X.” Furthermore, theanalytics server may compare other fields in the index, such as a hashof the file's contents to determine if the file was revised.

In some configurations, the analytics server may not be able to discernthe actor or the action simply by comparing the old index record withthe new information from the third party data repository application.For example, if the “file A” was previously shared with multiple people,it may not be possible to know which of those people truly shared thefile with person X. The analytics server may coordinate partialinformation from different systems, for example, by analyzing emailmessages to identify if there was an email notification indicating“person Y shared file A with you (person X)” or by tracking activity onlocal systems and identifying that a given user edited a file. Theanalytics server may augment the database/index of consolidated activityby tracking and/or scraping information from user's webviews/screens/computers/applications as they work normally within thosesystems.

In order to identify whether two nodes are related, the analytics servermay first generate an initial label that includes data indicating thattwo files may be related (also referred to herein as implicit dataset orimplicit feedback dataset). The analytics server may then use a filecorrelation algorithm and/or scoring algorithm to generate a score thatrepresents a distance between two nodes. In order to achieve this, theanalytics server may execute various scoring algorithms and or AI/MLmodels.

As described herein, the analytics server may execute various analyticalprotocols to identify whether two nodes are related. The analyticsserver may only link two nodes when the analytics server has determineda likelihood of the two nodes being related that satisfies a threshold.For instance, the analytics server may execute various analyticalprotocols described herein to identify whether two nodes are related. Ifthe likelihood of two nodes being related satisfies a predeterminedthreshold (e.g., the analytics server can confidently determine that thetwo nodes relate to each other), the analytics server may link thecorresponding notes.

The predetermined threshold may correspond to a confidence scoregenerated by the analytics server that identifies a likelihood of thetwo nodes being related. The system administrator or an end user canrevise the predetermined threshold. When the predetermined threshold isincreased, the analytics server may only link nodes when the confidencescore is higher. For instance, a predetermined threshold of 80% mayrequire the analytics server to only link nodes if the analytics serveridentifies (with an 80% confidence) that the two nodes are related.

For possible related nodes that do not satisfy the predeterminedthreshold, the analytics server may link the nodes, such that the nodesare “suggested” as related. As will be described herein, the analyticsserver may suggest related files/workflow components to the end user. Iftwo files are merged, the analytics server has determined that the twofiles are related. However, if the analytics server determines that twofiles are likely to be related, instead of merging/linking theircorresponding nodes, the analytics server may link the nodes. Whendisplaying results, the analytics server may then suggest the nodes asrelated, whereby the end user is informed that the two files may be (butnot guaranteed to be) related.

For instance, as illustrated in FIGS. 26-27, the analytics server maydisplay a software platform with a user's existing software applicationsand accounts (e.g., email accounts or a homegrown file managementsoftware). The displayed interface may show related files grouped bycontext or project (FIGS. 25-26). The platform provided by the analyticsserver also allows users to consolidate all relevant files such thatusers can easily work across accounts. For instance, even if differentfiles are stored within different platforms, a user may access them (ifthey are designated as related) using the same platform (FIG. 27).

The information that the analytics server indexes, correlates, andcontextualizes can also be queried and accessed via API by any number ofthird party software products and tools. In this way, the dataaugmentation can be used to enrich workflows within, for example, auser's favorite email client or task manager.

Initial Labeling

The analytics server may generate a set of predeterminedfeatures/indicators that indicate whether to nodes may be related. Thisinitial labeling may create a weaker/noisier signal that the analyticsserver may improve using ML and reinforced learning. The type of nodes(files, messages, users, tasks, notes, etc.) may have some impact on howthe analytics server generates the initial labeling. The analyticsserver may generate an implicit feedback dataset. The analytics servermay execute various analytical protocols (e.g., scoring and AWL models)using the implicit feedback dataset to identify whether two nodes arerelated.

The implicit feedback dataset is based on gathering observations thatindicate possible relationships between nodes. When a user performs anaction using a software tool connected to the analytics server, an eventis added to the analytics server graph and passed to the filecorrelation algorithm, or corresponding algorithm if dealing withnon-file/other types of nodes. In a non-limiting example, the analyticsserver may group events together by a heuristic the analytics servercall “session groups,” which can be easily understood as a session. Allthe files and components of work that are worked on by a givenindividual within the same session are grouped together into a “sessiongroup.” An assumption is that files that appear in the same “sessiongroup” more often have a higher chance of being related. File pairs witha high chance of being related may receive a higher “relationship score”than files that do not. The output is a data frame with a preliminaryrelationship score for every file-file pair.

The analytics server may also provide explicit suggestions for files inaddition to the implicitly generated suggestions. These may include thelinked nodes of a particular nodes' other linked nodes that might havebeen shared with a user immediately before and/or afterwards by the sameperson. In another example, files that might have been sharedimmediately before and afterwards in a chat session may also bedesignated as related. Lastly, the analytics server takes this pool ofimplicit and explicit suggestions and linked files, and compares themetadata such as names, file snippet, file contents, and/or mime typesto determine whether to suggest files as versions.

In an example, the analytics server generates an initial labeling (e.g.,implicit feedback dataset) indicating that two nodes are related basedon idle time. An idle time may refer to a scenario where two nodes havesuccessive activity events by the same person within a predeterminedamount of time (e.g., less than 45 minutes). Time logging data enteredor confirmed by a user can be used to influence a relative score betweentwo nodes.

In an example, the analytics server generates an initial labelingindicating that two nodes are related based on open or used applicationson a user device. Users can open locations, files, nodes, contacts,applications, etc. within analytics server's software tools or windows(or sometimes third party application monitored the analytics server).Having these elements listed above open in the same analytics serverwindow (or external window, third party browser window) can influence arelative score between them.

In another example, the analytics server generates an initial labelingindicating that two nodes may be related based user associations. Userscan associate locations, files, nodes, contacts, applications, a givenaccount for an application, etc. with graphical user interfaces providedby the analytics server (e.g., smart windows described in FIGS. 18A-M).If both elements have a hard relationship (e.g., pinned to the window,added as a source for the window, etc.) with an analytics server window,then they are more likely to have a higher relative score (e.g., closerdistance) between them.

If elements A and B are recommended to the same smart window C (e.g.,based on soft factors) then the analytics server generates a relativescore between A and B indicating that they may be related. Othersoft/implicit factors may include being viewed in the samewindow/application.

The analytics server may also retrieve additional data associated witheach node. Non-limiting examples of data retrieved may include existingrelationships between nodes (e.g., other connected nodes). The analyticsserver may also retrieve location information, IP address, device, ID,and other data associated with different electronic devices thataccessed the files/data.

The analytics server may also consolidate activity for a given user andmay create relative scores between components of work because ofcloseness of a user's activity around them (e.g., file edit history).The types of activity events performed by users may also influence therelative score between nodes. For instance, if a user edits File A,reads File B, edits File A, reads File B, the analytics server maycreate a higher relative score indicating a higher likelihood ofrelatedness than if the user reads file A, reads file B, and reads fileC.

The analytics server may also use various scoring algorithms (that canbe revised and tuned by a system administrator) to generate a scoreindicating whether two nodes are similar. For instance, the analyticsserver may use the score to identify users who may know informationrelated to a file (e.g., workflow component). When a user accesses afile, the analytics server may recommend a user who is associated withthe file (“other users who may know about this”). The analytics servermay rank users based on their respective scores and may recommend a topportion of the ranked users (e.g., top 5 users). To generate the score,the analytics server may attribute 1.5 points per collaboration onmessages that user has sent; 1 point per collaboration on messages thatuser was in “To” field/sent slack message; and 0.5 points percollaboration on messages that user was in “CC” field/was recipient in aslack direct message. To identify and score related content, theanalytics server may assign 0.75 points per collaboration oncopies/versions that user has activity on; and 0.5 points percollaboration on copies/versions that user has no activity on.

The analytics server may automatically link nodes that correspond tofiles and workflow components in accordance with the following rules:

Shared email message: if file A and file B share an email message;

Shared email thread: if file A and file C share are in two emails withinthe same thread;

Shared chat message: if file A and link B share a chat message;

Shared message thread: if file A and file C are in two messages withinthe same thread; and

Shared task: if files A and B are attached or referenced in the sametask.

File Relationship Algorithm

The analytics server may execute a file relationship algorithm toidentify/recommend relationships among nodes. The analytics server mayleverage the history of relationships stored within your connectedlocations to build newly suggested relationships. As used herein, aconnected location may refer to an application that is integrated intothe methods and systems implemented using the analytics server. Forinstance, a third party messaging application, desktop folders, cloudserver, an organizations internal and/or external CRM software tools,may all be considered as connected locations.

Furthermore, consolidating user activity allows the analytics server toloosely suggest implicit relationships between files that are worked onat similar times. The analytics server then uses these explicit andimplicit relationships between files as a starting point over which acollaborative filtering model is executed to provide recommendations(e.g., identify related nodes).

The observational data gathered from implicit feedback may not be useddirectly to provide sophisticated recommendations, and is therefore usedprimarily as preliminary data. This is because implicit feedback mayhave certain characteristics that must be accounted for to getmeaningful suggestions. For instance, implicit datasets may not providenegative feedback without additional processing (e.g., if there is noregistered interaction between files, then the “missing data,” or thelack of a relationship between file pairs is not addressed withoutfurther processing). In another example, implicit feedback may be noisy.In another example, the relationship score between two files may onlyindicate the system's “confidence” over the existence or non-existenceof a relationship, and may not provide the actual strength of therelationship.

A visual representation of possibly related nodes is illustrated in FIG.10. In the nodal data structure 1000, the analytics server identifiessome nodes as related using explicit factors. These relationships areillustrated using solid lines. For instance, nodes 1020A-E are relatedto the node 1030. The nodes 1020A-E may represent similar nodes as thenodes 910A-E illustrated in FIG. 9. The analytics server may identifyother related nodes using explicit relationships. For instance, the node1010G is relate to node 1020B because of a similar MD5 hash value. Thenode 1010F is also related to the nodes 1010G, 1010E, and 1020D.Similarly, the nodes 1010C and 1010B are related.

The analytics server also identifies that some nodes are possiblyrelated due to implicit relationships. These relationships areillustrated using dashed lines. For instance, the analytics serveridentifies that the node 1010A-E may be related to the node 1030. Theanalytics server may also designate a likelihood of the nodes beingrelated based on the type of implicit data. For instance, the node 1010Ahas a low likelihood of being related to the node 1030 because similarpeople can edit the file that corresponds to the node 1010A. Incontrast, the file that corresponds to the node 1010D is likely relatedto the node 1030 because the file has a solid line (e.g., hard factor)relationship with the node 1010F, which has a solid line relationshipwith the node 1020D, which has another solid line relationship with thenode 1030.

Scoring/Imputing

After initial labeling, the analytics server may calculate a compiledrelative score between nodes (e.g., between files and other files,between users and other users, and items to identify whether they arerelated). As described above, the analytics server may generate animplicit feedback dataset that includes all retrieved context/sessiondata (step 1110 in FIG. 11). The analytics server may then execute ascoring algorithm to identify whether the nodes are related (step 1110in FIG. 11). The analytics server may also execute one or more computermodels configured to identify whether the nodes are related. The modelsmay employ artificial intelligence and machine learning algorithms todetermine a likelihood of relatedness for each pair of nodes using theimplicit data previously retrieved.

The AI/ML models may be trained using training datasets (e.g., groundtruth datasets) that represent known related nodes. Once trained, themodels can be executed to identify whether two nodes are related. Theanalytics server may also periodically monitor the model's outcome toretrain the models based on identifying false positive and revisingvarious algorithms utilized by the models.

The analytics server may generate a score matrix based on the implicitdataset. For instance, the analytics server may generate anelement-element co-occurrence matrix by aggregating over all sessionsand calculating: X_ij:=(Count of distinct sessions where both element iand element j had events).

In this sparse symmetric matrix, X is factored into UV′ using anappropriate numerical method, such as Alternating Least Squares providedby implicit package, although other algorithms may also be used. Thenumber of columns for U may correspond to a tuning parameter chosen byk-fold cross validation. The analytics server may then apply aRegularization algorithm (e.g., alpha is also chosen by k-fold crossvalidation). The number of iterations is a nuisance parameter. In someconfigurations, the analytics server will warm start the factorizationupdates.

The analytics server may employ the following method to achieve theabove-described results:

Step 1: Related files are determined by scoring an element;

Step 2: File Detail view makes request to recommender micro service;

Step 3: Identify element i for a given file;

Step 4: Calculate predicted scores of X_i* by multiplying U_i×U;

Step 5: Filter out all element pairs, which are explicitly linkedelsewhere in the graph;

Step 6: Take top 20 remaining elements;

Step 7: Filter out all elements scoring below threshold \tau (where \taudetermined via cross-validation);

Step 8: Return elements that remain as JSON;

Step 9: Mapped back to files by Web server.

In that embodiment, the analytics server may generate groupings of filesthat were worked on by the same individual during similar times. Thesegroupings are called “session groups” (e.g., context data associatedwith the event) and are created by querying the database and/or thenodal data structure for activity events recorded in timelines for eachfile. Different types of events over files can have different relativeweights corresponding to how important they might be within a given“session groups.” For example, receiving a file during a given worksession will have less impact to an “session groups”, than viewing afile during that same session, which in turn has less impact thanediting a file during a given session. The matrix is then factored intoX=UV′, where U and V are n by k matrices and k and λ factors areselected via cross validation methodologies. When the analytics servergenerates a score for file i, scores are calculated (e.g., UiV), sorteddescending, and filtered based on a threshold.

As previously described, the implicit dataset may be noisy, which maylead to incorrect results. To address this issue, the analytics servermay various curve-smoothing methods. For instance, the analytics servermay use the collaborative filtering technique described in the paper“Collaborative Filtering for Implicit Feedback” (Hu et al., 2008). Thistechnique relies on using Alternating Least Squares to minimize a lossfunction. Alternating Least Squares method exploits the algebraicproperties of the loss function to minimize it in linear time. Smoothingin this way generates recommendations that are more accurate given thenature of implicit feedback datasets. In some other embodiments, theanalytics server may utilize a stochastic gradient descent method.

To improve the scoring process, the analytics server may also includeexplicit relationships among nodes into the model before smoothing thedata points. In some configurations, the analytics server may executethe model to generate two sets of recommendations, based on twodifferent scores (one from explicit suggestions, and the other fromsmoothed implicit data points). The analytics server may combine thesescores and generate only one set of recommendations that take bothimplicit and explicit data into account.

In some configurations, additional factors can be taken intoconsideration in scoring the relationships between nodes. For example,the analytics server may add weights to different types of events thatare used in creating implicit relationships between nodes. This assumesthat certain activities are stronger indicators of relationships thanothers are. The analytics server could also use types of events betweennodes to suggest types of relationships. For instance, the analyticsserver can adapt an approach that also analyzes other metadata, such asemail content surrounding the files, common collaborators between files,common folders that contain both files, and the like. The analyticsserver may compare text-related files using low-dimensional vectorrepresentations of them. Similarly, the analytics server may calculatesemantic similarity between components of work (i.e. documents, files,messages, tasks, etc.) to find text-based similarities, which mayindicate that two components or work are similar and/or related.

Labels and Scores Change Over Time

Referring back to FIG. 8, at step 830, the analytics server mayperiodically reclassify and relabel nodes to achieve better results. Theanalytics server improves the quality of initial labeling and the AI/MLpowered scoring by considering user behavior. For example, the analyticsserver may provide an option for users to accept and/or rejectrecommendations and have their actions influence the earlier labels andmodels described herein. For instance, as a user is interacting with afile/workflow component, the analytics server may generate a list ofpossibly related files/workflow components. The analytics server maythen prompt the user to identify whether the suggested files/workflowcomponents are indeed related to the user's project/files. Users may beable easily verify whether a suggestion is a good suggestion by eitheraccepting it or dismissing the suggestion. These user responses willupdate the score between the two nodes. A naive implementation thatcould be used as a starting point would divide the current score betweenthe two files in half. Furthermore, validations and/or rejections ofsuggestions should be used as a training set to improve recommendationgenerated by the analytics server.

In some configurations, if a user dismisses a recommendation, theanalytics server does not revert the score identifying the relationshipbetween the two corresponding nodes to zero. Instead, the analyticsserver may adjust the score, such that other considerations are takeninto effect. For instance, user 1 and user 2 have local access to file Aand file B. They are not shared files, so both users each have 1 privatecopy of each file. If user 1 accepts a relationship between file A andfile B, and user B rejects the relationship between file A and file B,the analytics server could use both inputs in calculating a new score.The factors that could potentially influence a relationship betweenfiles A and B do not have to result in a binary output of options.Furthermore, having an understanding of all of the factors and metadatasurrounding a conscious acceptance/rejection, a passively browsing clickto view the file, the context in which a file was viewed, by whom, inwhat location, on what device, etc. . . . can help personalizerecommendations contextually by user or by project. There does not needto be a single score between nodes (e.g., there can be a single scorebetween nodes per user).

An example of suggested related files is depicted in FIGS. 26-27. GUIS2600 and 2700 are similar. However, for clarity, GUI 2700 focuses onspecific GUI elements related to methods and systems described herein.

In the depicted embodiment, the user uses the graphical element 2706search and browse files within various data repositories and/or softwaretools. The analytics server may display the requested files in thegraphical element 2602. The analytics server may then display thegraphical element 2702, which suggests possibly related files and/orworkflow components. The analytics server may also display the graphicalelement 2704, which displays the requested files and/or other versionsof the requested files that may be stored within one or more datarepositories and/or software tools.

The analytics server may also identify subsequent and/or previousversions of the same file/workflow component. For instance, theanalytics server may identify closest nodes to any given node andprocess the information again with new/alternate algorithms that canestablish particular types of relationships between the nodes. Forexample, analytics server may compare the text in the contents of thissubset of nodes (e.g., nodes that are more similar or closer indistance) more easily than comparing text in all of the existing nodes.This method also requires less computing power because fewer nodes areanalyzed. If text is similar enough (e.g., likelihood of similaritysatisfies a threshold), the analytics server suggest two nodes asversions of each other. Specific types of actions and relationships canalso start to inform classifications. For instance, two files withsimilar names that share an email thread may likely be versions of eachother.

In order to increase efficiency and execute more detailed analysis, theanalytics server may identify a subset of the nodes and execute variousadditional analytical protocols. The additional protocols may includeprotocols that extract and identify similarities between contentcorresponding to the subset of the nodes. For instance, where in someembodiments, the analytics server compares metadata corresponding to afile or node, in some other embodiments, if there is only a subset ofnodes, the analytics server could do deeper (more detailed) processingand comparison of the contents of the nodes. The analytics server mayexecute additional models/algorithms that may be too costly orinefficient to run on all possible combinations/permutations of allnodes within the nodal data structure.

In some configurations, the analytics server may allow users to definetheir own types of relationships, association, and other pertinentinformation. In this way, the analytics server may provide users withthe freedom to label relationships with any label they choose. As theamount of user-defined, implicit, and other explicit relationshipsincreases, the analytics server (using recommendation algorithms) willbe able to provide better suggestions that could include specificrelationships types. Non-limiting examples of these relationships mayinclude, files that are derivatives of other files (e.g., working filesor parent-child relationships between files), files that are outputs orexported from other files (e.g., a pdf file and the associated worddocument that generated the pdf), files that are previews of other files(e.g., an image of a 3D model and the associate CAD file), emails thatare related to the same deal, etc.

Referring now to FIG. 12, a nodal data structure linking nodes based onthe methods/systems described herein is illustrated, in accordance withan embodiment. The node 1210 is linked with various other nodes usinghard/soft factors and the methods/systems described herein. Theanalytics server links the nodes 1220A-E to the node 1210 based onexplicit data, illustrated as solid lines. The analytics server thenlinks the nodes 1230A-F to the node 1210 as related nodes identifiedusing implicit data, illustrated in dashed nodes.

The methods and system described herein can be used for an informationmapping business that helps users improve their access to relevant andunderstanding of fragmented work data. The methods and systems describedherein can be used in conjunction with a variety of software tools. Thevalue of these methods and systems, regardless of how the technology ispackaged, lies in helping users automatically and/or manuallyinterrelate units of work, while also contextualizing them within thelarger objectives/contexts, such as, teams, projects, clients, classes,etc.

The analytics server allows users to increase their ability to workcontextually without being required to provide the analytics serveraccess to any of their underlying data. However, users must share somecontext data with the analytics server in order allow the analyticsserver to map data and identify related nodes. As a result, the methodsand systems described herein can be implemented on any data repositorywithout requiring a change within the existing infrastructure. Forinstance, the analytics server may be utilized within an organizationwhere the analytics server can work in conjunction with existinginfrastructure and software tools to contextualize data.

Moreover, users can also use analytics server's information mappingservices as native interface components, for example through an API(FIG. 28). The interface shown in FIG. 28 may be integrated into anyother application, including a third party application, therebyeliminating the need to use multiple applications (e.g., existingapplications of an organization and an additional application to viewdata provided by the analytics server).

The methods and systems described herein can also solve long-standingdata fragmentation problems. FIGS. 22A-B show a conventional method ofoperation of conventional software solutions. For instance, as depictedin FIG. 22A, users typically have a fragmented approach to accessingunits of work (e.g., files, messages, tasks, etc.) and digital workflowsare organized by applications rather than by objective. As a result,users experience a disconnect between intent and their interaction withwork inside the digital tools they use, ultimately making it difficultand unnatural to access information, particularly for people who have donot have previous experiences with the organization of a given projector objective.

FIG. 22A illustrates the current paradigms for how users interact withapplications and their data. The term “ecosystem,” as used herein, mayrefer to a software ecosystem or an operating system prevising services.The ecosystem may be interpreted as the World Wide Web (the browserthrough which the web is accessed) and may include all the differentapplications that run on the web. Different application and theirecosystem may share some data/processes/functions within otherapplications in their ecosystem. For example, sharing a file withsomeone through data repository (e.g., Google Drive®) and sending anemail to someone else through an electronic mail messaging application(e.g., Gmail®) may add both of those individuals to a single contactlist in a suite of applications (e.g., G-Suite®) because bothapplications are part of the same ecosystem. Under most circumstances,contacts across ecosystems remain unlinked. Because most ecosystems arenot designed to communicate proprietary data, users using multiplesoftware applications belonging to different ecosystems may not benefitfrom interchange of data among the used software applications.

Most workplaces and individuals today, live with digital workflows thatleverage software from multiple ecosystems, which results in a lot offragmentation, duplicate efforts, and time spent searching for orunderstanding information, increased onboarding time of new employees orproject team members, and the like. For example, it is not uncommon fora modern knowledge worker to communicate and collaborate simultaneouslyacross several software applications on any given day: internalmessaging via a chat tool (like Slack), external messaging via an emailtool like outlook, respond to comments and conversations in documents(like Google Docs or Quip), update team members with comments over agiven task or support ticket, etc.

The analytics server may identify related information for a givencontext, in essence using the context as search criteria for all theunderlying information within each application. The analytics server mayalso generate suggestions on nodes and information that may be relevantto a given context so a user may access each piece of information moreefficiently. Furthermore, as illustrated in FIG. 22B, the method andsystems described herein aggregate all electronic files and theirrespective context data (other associated information, such as relatedfiles, communications between users associated with each electronicfile, and the like) in one place; All electronic messages and theirrespective context data (other associated information such as thesender, related communications, related nodes, and the like); Allelectronic tasks and their respective context data; All calendar eventsand their respective context data (related files, communications, etc.);and All contacts and their respective context data (other associatedinformation like role, title, relationship with other people, relatedfiles, messages, etc.); and all notifications.

FIGS. 22-25 illustrate how this fragmentation typically manifests itselfin a given user's workflow. FIG. 23 shows how an account manager orsales development representative who works with multiple clients has toconstantly communicate with both internal and external stakeholders inorder to help manage communication and successful project delivery. Theaccount managers and/or sales representatives may be responsible forunderstanding and representing the client's needs internally withintheir agency or client service firm. Simultaneously, these sameindividuals need to understand their agency's capabilities andavailability in order to keep the client informed of current projectstatus as well as to help sell additional services to the client. Thedifficulties start to compound when each person within the agency endsup working on multiple projects for different clients and theinformation required to understand who is working on what, what thecurrent status of a project is, and why things were done in a particularway is fragmented across channels.

Knowledge workers that work with multiple clients experience thisproblem more acutely than others might as is shown in FIG. 24. Asdepicted, each account manager ultimately has to use his/her internalcompany tools, as well as interface with a variety of tools that theirclients use. This results in more extreme fragmentation, makes it easyfor information to get lost, and users end up experiencing communicationand collaboration problems. It frequently manifests with workers usingmultiple browsers for different client accounts and an overwhelmingnumber of tabs simultaneously open to try to keep track of theirworkflows as can be seen in FIG. 25A.

FIG. 25B illustrates an alternate approach that can serve as a solutionto the above-described problems. Instead of requiring that userinteraction with digital work be siloed by application or ecosystem,these figures describe how data that exists within each application canbe restructured and displayed in ways that are more functional to theend user, and that are application agnostic. This alternate approach canbe broadly described in two parts.

The first part is the consolidation of units of work across applicationsand their organization by the type of unit it is (e.g., files, messages,notes, contacts, and tasks) across all of user's work. In order to helpa user to respond to and manage this fragmented communication, thealternate approach offers a solution that consolidates all of thosecommunications in one message client rather than across applications(FIG. 29). Similarly, that user may desire to have all his/her emailattachments, local files, cloud storage files, files attached in projectmanagement tools, etc. accessible through one file browser; all hercontacts consolidated in one contact list; all her notes browsablethrough one interface; all her calendar events consolidated in onedatabase and interface.

The methods and systems described herein will improve a user's abilityto manage his or her workflow across tools, applications, websites, andaccounts. Furthermore, consolidating all of this information providesthe analytics engine with the opportunity to provide data enrichment andcontextualization to the end user around their existing work, asdescribed elsewhere in this patent.

One of the problems with employee directories and organizational chartsat large organizations is that they quickly become outdated. Thisproblem is exasperated if no employee(s) is dedicated to keeping theorganization charts current and updated. Using the methods/systemdescribed herein, the analytics server is able to automatically analyzeand consolidate multiple profiles and other employee information fromseveral systems (internal and/or third party). The analytics server canalso aggregate and present user profiles and organization charts in aneasy-to-understand graphical user interface. Users are able tounderstand the relationships between people and other people, people andcontexts (what projects, clients, and objectives colleagues have beeninvolved), and people and Components of Work.

The analytics server is also able to utilize the systems/methodsdescribed herein to automatically identify and rank an employee'scolleagues. For instance, the analytics server may use themethods/systems described herein to identify related nodes correspondingto user behavior, interactions, and communication. The analytics servermay monitor and analyze files and/or workflow components interacted byeach employee (e.g., accessed, edited, and/or deleted) to generate a defacto organization chart. For instance, the analytics server may analyzethe number of shared, overlapping meetings, files, messages, tasks, andsimilar workflow components associated with each employee to generateand/or update the organization chart. The analytics server may alsoaugment the above-described analytical protocols by insights gleanedfrom the consolidated activity, such as the employee's average responsetime to messages or other activity events from a given colleague.

With an overall understanding of all employees and their workflow, theanalytics-server-implemented method begins to identify clusters ofemployees and types of relationships between people that a user caneasily confirm or correct. For example, product teams that share a lotof content and messages can automatically be grouped into a cluster andsuggested as a team. In another example, the analytics server mayidentify communication and/or file interaction patterns between anemployee and his/her manager because they typically have a lot of sharedcontent (e.g., information and/or files), frequent messages, andregular/standing meetings/checkups every week, and the like.

By having a consolidated understanding of employee activity acrossdifferent systems/platform and having an understanding of therelationships between users, the analytics server is able to identifyand flag a particular user who has communication pattern changes. Forinstance, the analytics server is able to recognize and ask for updatedinformation if/when an employee changes teams or starts reporting to adifferent person within an organization.

The analytics server may automatically and periodically identify theabove-described patterns and may automatically and periodically identifya change in the organization's hierarchy based on employees' behavioralpatterns. Therefore, the analytics server may identify and revise theorganization chart when an employee's role is changed, even if theemployee's official title has not changed.

FIG. 13A illustrates a flow diagram of a process executed in anelectronic workflow management system, in accordance with an embodiment.The method 1300 includes steps 1302-1314. However, other embodiments mayinclude additional or alternative execution steps, or may omit one ormore steps altogether. In some embodiments, the method 1300 may beexecuted on various other workflow components, such as tasks, messages,notification, and the like.

In addition, the method 1300 is described as being executed by a server,similar to the analytics server described throughout this disclosure.However, the described steps may be executed by any number of computingdevices operating in the distributed computing system described inFIG. 1. For instance, part or all the steps described in FIG. 13A, maybe locally performed by one or more user computing devices or anadministrator-computing device. Furthermore, even though some aspects ofthe method 1300 is described in the context of a web-based application,in other configurations, the analytics server may display related datain a mobile application or an application native to the user's desktop.

At step 1302, the analytics server may periodically scan a plurality ofelectronic data repositories accessible to a plurality of computingdevices to identify a plurality of files stored onto the plurality ofelectronic data repositories where each file is accessible to at leastone computing device within the plurality of computing devices. Asdescribe above, the analytics server may use various protocols toidentify various files stored onto data repositories accessible (oraccessed) by one or more computers within a network. Furthermore, asdescribed above the analytics server may use various APIs, web books,and the like to identify the files accessible and/or accessed by theusers within the network.

At step 1304, the analytics server may execute a predetermined protocolto generate at least one unique identifier of each file within theplurality of files. At step 1306, the analytics server retrieves foreach file, context data comprising at least one of a time stamp andaccess/edit history. At step 1308, the analytics server executes acomputer model to identify related files based on each file's contextdata and unique identifier to generate a plurality of groups where eachgroup comprises at least one electronic file.

As described above, the analytics server may use one or more analyticalprotocols to identify related data. For instance, in some embodiments,the analytics server may use the unique identifier generation method(also referred to herein as the hard factors) to identify whether twofiles are related. Additionally or alternatively, the analytics servermay execute various analytical protocols and/or artificial intelligencemodeling techniques (also referred to herein as the soft factors) toidentify whether one or more files are related. In some configurations,the analytics server may use a combination of the hard and soft factorsto identify related files and workflow components.

As described above, analytics may generate and periodically update anodal data structure where each node represents a unit of workflow, suchas a file, message, a user, or any other content related to workflow.The nodal data structure may include interrelated nodes that areidentified as related, using methods and systems described herein.

The analytics server may also generate various groups/clusters of nodeswhere each node within a group/cluster is related to other nodes. Forinstance, a group of nodes may correspond to nodes associated with aproject or a client. The analytics server may group nodes based onvarious attributes. Therefore, a group of nodes may represent any groupof nodes that correspond to workflow components that share at least oneattribute.

At step 1310, the analytics server monitors electronic communicationbetween a set of users to identify a set of electronic communicationsbetween at least two users, where each electronic communication isassociated with at least one group of files. The analytics server maymonitor all electronic communication between users where they electroniccommunication involves at least one node within at least one group.Electronic communications may involve any identifiable communicationusing organization servers and/or the analytics server.

At step 1312, the analytics server identifies context data associatedwith each identified electronic communication event, the context datacomprising at least a time stamp of each electronic communication eventand an electronic file and its corresponding group. The analytics servermay monitor timestamps associated with each communication event. Forinstance, the analytics server may monitor a “response time” associatedwith each communication event. The response time may correspond to atime period between when an electronic message was received (or seen)and when a response was sent back.

The analytics server may consider other data associated with theusers/nodes to identify relationships among users. Things considered bythe analytics server may include number of shared files, messages,tasks, calendar events, and the like within two users within apredetermined time. In another example, the analytics server mayconsider activity events across software tools. For instance, theanalytics server may consider the type of activity events (e.g., view oredit). If events are one sided, it could inform type and direction ofrelationship (e.g., someone is always assigning tasks to someoneelse)—it could also inform who else reports to the same manager andtherefore can create ‘sibling’ relationships between people. In anotherexample, if one user always edits the other users work product, defensemay indicate that the editing user may have a higher score within theemployee organization chart. In some embodiments, deviations inpatterns/relationships can come from changes not between two people, butchanges between a group. Where if a manager stops assigning tasks to oneuser in the “family,” the whole family might need to be re-evaluated.

The analytics server may also consider data retrieved from third partytools in order to generate relationships between users. For instance,the analytics server may initially retrieve relationships from anexisting organizational chart from an internal and/or external HRsystem.

The analytics may also consider information and manual inputs fromusers. For instance, the analytics server may retrieve labels manuallyentered by different users (e.g., on their social media or otherwebsites) and/or confirmation/rejection of previous recommendations. Ifthe analytics server identifies a possibly new relationship, theanalytics server may prompt the user to enter the user's relationshipwith one or more other users within an organization. The analyticsserver may retrieve roles, titles, job description, and other relevantinformation as indicators of a user's position within an organizationalchart.

The analytics server may also consider Contexts/smart windows that maybe centered on teams and that have information relating to team dynamicsin them. As will be described below, the electronic platform provided bythe analytics server may include various contexts specific windows thatare customized based upon users and/or projects. The analytics servermay use data customized by each user to identify relationships amongusers. For instance, if a user generates smart windows for other users,the analytics server may assume that the user who has generated thesmart windows is a manager of the other users.

The context data may also refer to the language used within theelectronic communication. For instance, the analytics server may usevarious natural language processing protocols to identify sentimentassociated with each electronic medication. Sentiment, as used herein,may include tone of the electronic communication event. For instance,the analytics server may execute a sentiment analysis protocol that usesartificial intelligence and/or machine learning to identify sentiment ofelectronic communications between two users based on the vocabulary usedin those electronic communication sessions. Understanding the sentimentof an electronic communication session may allow the analytics server toidentify a relationship between the users.

At step 1314, the analytics server may generate a score for each userwithin the set of users, the score corresponding to the identifiedcontext data of the identified electronic communication events. Theanalytics server may generate a score for each user based onpredetermined rules. The score may correspond to an importance level, ahierarchical relationship, or otherwise labeled/classified relationship(e.g., client < > consultant, friends, secret lovers, etc.) between theusers. For instance, in some configurations, the score may correspond toa response time between two users. Additionally or alternatively, thescore may also correspond to the identified sentiment value.

In a non-limiting example, the analytics server monitors employee A andhis electronic correspondence with other employees. The analytics serveridentifies that employee A predominantly receives electroniccommunications (e.g., emails) from employee B, C, and D. The analyticsserver identifies that employee A's response time to employee C is 4minutes. In contrast, employee A's response time to employee B and D is18 minutes. Similarly, the analytics server identifies that employee A'scommunications with employee B and D correspond to a much friendliersentiment than his communications with employee C. Using the abovedescribed information, the analytics server may generate a higher scorefor employee C than employee B or D. Using the above describedinformation, the analytics server may conclude that employee A mayreport to employee C. Therefore, employee C may be higher in anorganization chart than employee A, B, or D.

In some embodiments, the analytics server may generate a second nodaldata structure comprising a set of nodes where each node corresponds toan employee and their respective score. The analytics server may thenarrange the nodal data structure according to each employee's score. Ina non-limiting example, as depicted in FIG. 13B, the analytics servermay arrange different employees based on their score. The depicted nodaldata structure 1301 may correspond to a hierarchy among the depictedemployee 1318A-J. For instance, employee 1318A has a higher score thanall other employees. Similarly, employees 1318B and 1318E have higherscores than employees 1318C-D and 1318F-J.

Using the methods/systems described herein, the analytics server canalso establish patterns in the relationships between two given employeesand recognize any potential changes in the interaction patterns,regardless of how unique the relationship between two individuals mightbe. The analytics server may generate insightful graphs representing anorganization, including but not limited to, generating and/orautomatically maintaining a de facto organizational chart. For instance,employee A may be a manager of employee B. The employees AB usually meetat 8 am on Monday. The two employees are have a lot of regularcommunication, overlapping calendar events, files, etc. However,employee B then changes projects and therefore his/her manager mightchange.

In another example, employee A may be officially designated to bereporting to employee B. However, based on their communications (e.g.,how fast employee A responds to employee B and other employees), sharedprojects, and working on different files that are related, the analyticsserver may identify that employee A reports to employee C instead. Theanalytics server may continuously/periodically revise the dynamicorganizational chart as more relationships are identified or newemployee relationships are created (e.g., employee A is now assigned toa new team overseen by employee D). The analytics server may alsoprovide searching capabilities to efficiently search foremployees/resources based on this dynamic chart.

Conventionally, manual revisions of the organization chart is required.However, if an administrator has not updated the organization chart, theanalytics server (or anyone else within the organization) will benotified on this change. The analytics server can automatically identifythat employee B and employee A's relationship has changed: both gotaccess to different files and content, started working in differentfolders, no longer have regular meetings, there is fewer communicationevent, and the like. The analytics server may then prompt each user (ora system administrator) to update the information and even begin tosuggest whom the new manager-employee relationship might be. Theanalytics server will continuously revise the dynamic organizationalchart as more relationships are identified or new employee relationshipsare created (e.g., employee A is now assigned to a new team overseen byemployee D).

At step 1380, the analytics server may generate a graphical userinterface having a set of graphical component each representing a userwithin the set of users, wherein the set of graphical components arearranged in accordance relationship between each user represented byeach respective graphical component.

The analytics server may generate a graphical user interface thatdisplays a hierarchy of the employees within an organization. Theanalytics server may use the nodal data structure 1301 to arrange theemployees in accordance with its identified hierarchy.

Referring now to FIG. 13C, a non-limiting example of a graphical userinterface illustrating an organizational hierarchy is shown, inaccordance with an embodiment. GUI 1320 includes a list of all employeesand their corresponding image and contact information. A userinteracting with the GUI 1320 may filter the employees based on variousprovided filters, depicted in the graphical element 1322. For instance,the user may arrange the employees by department, office, role, team,project, and/or skill. When prompted by the user to rearrange theemployees based on an attribute, the analytics server may identify arelationship among users' nodes within the nodal data structure (basedon various analytical protocols described herein) and may arrange thevisualization of employees accordingly.

When a user interacting with the GUI 1320 interacts with an employeedisplayed on the GUI 1320, the analytics server may display detaileddata regarding that employee. For instance, when a user interacts withthe graphical component 1324, the analytics server directs the user toGUI 1326 (FIG. 13D) where more detailed data associated with theselected employee is displayed. GUI 1326 may include graphicalcomponents 1328 where the analytics server may display various resultsof the analytical protocols described herein. For instance, when a userinteracts with the graphical element 1330, the analytics server mayidentify nodes related Miglena Tadic (using the nodal data structure)and may direct the user to any related project/workflow. The analyticsserver may display data associated with nodes related to a node thatcorresponds to Miglena.

The analytics server may automatically generate profiles for eachusers/colleagues/collaborators by consolidating profile information fromany integrated third party tools. For example, the depicted user mayhave various third party software tools that are depicted in thegraphical component 1332. The analytics server may consolidateinformation from all of these tools around a profile that isautomatically generated and updated.

The analytics server may display a logo for a variety of third partyservices used by each employee, contact, or colleague. Therefore, eitherthe user viewing the GUI 1326, Miglena Tadic, and/or a third party haveconnected/integrated the illustrated accounts and that the analyticsserver has identified shared content between an email that the viewerhas confirmed/verified as his/her and an email that has been recognized(using the methods and systems described herein) as being related toMiglena Tadic. As depicted, the analytics server may also display otheremployees to whom Miglena reports.

The analytics server may use the method 1300 to generate and maintainorganizational charts for human resources; to document project teams; toprovide other human capital analytics/classifiers (early bird, late owl,social connector, and flag communication breakdowns), and the like. Eventhough, embodiments describing the method 1300 focus on a specific useof this process around people and organizational charts, the underlyingprinciples are very similar regardless of whether the method 1300 isapplied to files, employees, or any other workflow component (e.g.,tasks). For example, the method 1300 can be applied to other workflowcomponents and/or files to identify a hierarchy between the workflowcomponents and/or files.

Using the method 1300, the analytics server can also classify orrecommend classification types between the relationships. For example,the analytics server can analyze known relationships between two files(e.g., shared email message, shared email thread, shared task, similarfile contents, and/or files with similar activity by a user) andinfer/recommend relationships that may be more useful to users (e.g.,versions of each other, one is a derivate of the other, and/or relatedto a particular deal or project). As stated throughout, the method 1300may also be applicable to any other workflow component, such as nodescorresponding to people and their contact.

In other related embodiments, the analytics server may identify and rankemployee contacts as well. Employees' contacts typically fall out ofdate and lack contextual information. Employee directories requirededicated people to maintain, and even then, they fall out of date.Organizational charts require dedicated people to maintain, and eventhen, they fall out of date. Employees usually struggle to quicklyreceive and respond to the most important contact updates (e.g.,information, updates and action items) they receive across the manytools and accounts they work with. The analytics server can rankdifferent contacts.

The analytics server can retrieve a list of email addresses associatedwith the users. Each email address may have a list of relatedintegrations (on which it has permissions); consolidated list ofactivity events from the email address' related integrations;consolidated list of files the email address' integrations havepermissions on; consolidated list of messages the email address'integrations have permissions on; and consolidated list of tasks theemail address' integrations have permissions on.

The analytics server can also retrieve and display a list of users whereeach user has a list of related integrations (e.g., applications and/orworkflow components on which he/she has permissions), consolidated listof activity events from the user's related integrations and from theuser, consolidated list of files the user's integrations havepermissions on, consolidated list of messages the user's integrationshave permissions on, and consolidated list of tasks that have been havebeen created by or assigned to the user's integrations. The analyticsserver can retrieve and display a list of contacts associated with eachuser. Each retrieved contact may have a list of related email addresses,and through those integrations, the analytics server may identify newconnections.

The analytics server may also consolidate profile information from thecontact's related integrations. This is sometimes referred to asdeduping (or de-duplicating) nodes. For instance, the analytics servermay de-duplicate files with similar MD5 identifies and merge theirrespective metadata from different sources. The analytics server maythen execute various analytical protocols described herein to establishrelationships between users and label those relationships.

The analytics server may also use various profile data to integrate andidentify relationships among users. For instance, each user profile mayhave special fields, such as person to person relationships (e.g., Slackprofiles can have the following fields: mentor (user), birthday (date),title (string), etc. and a different company can have different fields),consolidated list of activity events from the contact's relatedintegrations, consolidated list of files the contact's integrations havepermissions on (e.g., the files, messages, etc. need to be able to beaccessed by the logins/integrations of both the 1st person user and thatuser's contact), consolidated list of messages the contact'sintegrations have permissions on, and consolidated list of tasks thecontact's integrations have permissions on. Each contact can also haveother related components of work, including files, messages, tasks,notes, etc. that can be established in much the same way asrelationships and labels between files are established. Similarly,contacts are able to have relationships with other contacts and/orusers.

The analytics server may allow users to share accounts for givenintegrations (e.g., multiple people can access a central email such asinfo@company.com). In this embodiment, not all activity from sharedintegrations may pollute the activity timelines for contacts that haveaccess to a shared account/integration. In other words, contact'sprofile does not show all activity from shared integrations, but it canshow activity that is identified as originated from user within sharedintegrations.

Using the methods and systems described herein, the analytics server cancreate a relative score between a user and all of his/her contacts(e.g., create a measure of relative distance between a user andcontact). Analytics server may provide a list, which allows users tofocus information around the people/contacts that matter most to them.The analytics server may establish labels between users (e.g., managerand manage and/or a user and a contact). Using the methods and systemsdescribed herein, analytics server may identify when an organizationalchart needs to be updated.

The analytics server may send a notification to multiple users tosuggest/recommend relationships and labels for the relationships (e.g.,manager, mentor, client, etc.) between different employees and/oremployees and their contacts. For instance, if the analytics serveridentifies that users A and B have high scores in relation to user C,the analytics server may recommend contacting user B when user C iscontacting user A. In a non-limiting example, when user C is drafting anemail to user B, the analytics server displays a prompt on user C'scomputer that recommends drafting a similar email to user A or includinguser A in the email as a recipient.

The relationships generated by the analytics server are dynamic and theanalytics server may revise them based on monitored user activity andbehavior. For instance, a change in a user's activity may indicate thata label (or lack of label) may have changed between a user and contact(or a cluster of contacts). The analytics server may revise theserelationship (e.g., relative score) based on monitored user behavior ofthe user, related users, and/or the contacts.

The analytics server may also augment suggesting relationships byidentifying a behavior pattern of the contact, such as the contact'sdepartment, related teams, and related projects. The analytics servermay also recommend when users/contacts are working (e.g., early bird,late owl, and prefers slack to email) or add labels to contacts toimprove collaboration.

Using the methods and systems described herein, the analytics server mayuse communication and activity patterns to also identify a relationshipbetween two users/employees. The relationships may not be limited toidentifying organizational hierarchy. For instance, the analytics servermay also identify social relationships among employees. The analyticsserver may create paradigms of relationships that it can try to matchbetween users, such as manager-employee, close friends, social conflict,etc. This can be used to improve operational performance, staffing,organizational transparency and lateral communication, etc.

The analytics server may also provide searching capabilities toefficiently search for employees/resources based on the dynamicorganizational chart. Using the nodal data structure representing theorganizational chart, the analytics server may receive a query (e.g.,“show me employees who work on project X”) and traverse the nodal datastructure accordingly to identify related nodes. For instance, even ifemployee A has not been officially designated as working on project X,the analytics server may identify employee A as a possibly working onproject X using the methods and systems described herein.

Conventionally, information shown in project profiles may not typicallyexist in one place. As a result, information associated with aparticular project may not be easily accessible to employees. Theanalytics server utilizes the nodal data structure (e.g., disclosedsystem of pointers that is building/presenting known and suggestedrelationships between components/nodes of information that reside withindifferent (sometimes third party tools) to identify a project associatedwith an employee's worked hours. End users are able to navigate theiractivity timelines and see relevant activity events automaticallycorrelated with different suggested projects. Ultimately, as usersaccept or submit their timesheets, they are increasing the analyticsserver's confidence that a given activity event and therefore relatedcomponent of work may be related to a given project. This helps theanalytics server populate project profiles with relevant information. Asdescribed and depicted in FIGS. 14-15A-D, the analytics server mayutilize the methods and systems described herein (e.g., nodal datastructure) to automatically recommend certain projects or clients forblocks of time depending on the underlying relationships betweenfiles/components of work, and certain projects.

The method 1400 uses the relationship-building methods described hereinto augment time entry applications. For instance, the analytics servercan use related files and users (e.g., how other users working on arelated file have documented their time spent on a project) to recommenda best option for time worked by users. In a non-limiting example,analytics server can recommend how much time a user should bill to aproject. The analytics server can also consolidate user timelines andtime spent on different projects. By consolidating the data, analyticsserver can also allow structured and unstructured searching (e.g.,e-discovery). For instance, when instructed, the analytics server mayretrieve all times billed to a particular project and identifycorresponding nodes within the nodal data structure. As a result, theanalytics server may retrieve all relevant files to a project (e.g., allfiles accessed by employees while working on a project).

Using the method 1400, the analytics server can provide time trackingsuggestions that attribute (or suggest) a user consolidated timeline ofactivity events to a particular project.

The analytics server may retrieve data from the existing graph (e.g.,nodal data structure) and relationship recommendations to suggest whatevents are related to which projects. The analytics server may also useother users/employees activities to identify a related project. Forinstance, if multiple users are related to the same activity or event(e.g., email sent/received, and calendar event) and one user classifieshis/her time as related to one project, the analytics server mayconclude with a certain level of confidence that other users (working onrelated material) were also working on that particular project duringthe same time. Using this method, the analytics server may improve therecommendation for what project that time slot is associated with forthe second user. This method may use information from third party timetracking tools to help increase/decrease the relative recommendationscore between activity events and their related files, messages, tasks,etc. to projects.

The analytics server may also provide users the option to classifycertain times and groups of activity/events as different projects and/orincrease/decrease the relative recommendation score between theunderlying information to the respective projects. The analyticsserver's recommendations may improve depending on a number of users whohave mapped (e.g., classified) different events. For instance, when apredetermined number of users map the same event as related to aproject, the analytics server can identify that the event and theproject are related with a high degree of confidence. Therefore, theanalytics server's recommendation improves with each mapping of eventsinputted by each user.

FIG. 14 illustrates operational steps for identifying a time periodassociated with one or more electronic files. At step 1410, theanalytics server may periodically scan a plurality of electronic datarepositories accessible to a plurality of computing devices to identifya plurality of files stored onto the plurality of electronic datarepositories where each file is accessible to at least one computingdevice within the plurality of computing devices. At step 1420, theanalytics server may execute a predetermined protocol to generate atleast one unique identifier of each file within the plurality of files.At step 1430, the analytics server may retrieve, for each file, contextdata comprising at least one of a time stamp and access/edit history. Atstep 1440, the analytics server may execute a computer model to identifyrelated files based on each file's context data and unique identifier togenerate a plurality of groups where each group comprises at least oneelectronic file.

At step 1450, the analytics server may, in response to receiving a timeinput from a user, identify a group associated with the user during thetime period inputted. The analytics server may receive a request from auser device to identify a group associated with an inputted time. In anon-limiting example, a user may select a timeframe representing a timeperiod of work hours. For instance, the user may access an electronictime entry form/software tool and may select a time. As describedherein, the analytics server may periodically scan (or use othersoftware tools described herein) to identify files/workflow componentsaccessible and/or accessed by each user within a network. The analyticsserver may then generate a nodal data structure and identifyinterrelated nodes using the methods/systems used herein.

As described herein, the analytics server may use various analyticalprotocols to identify and link related nodes. Therefore, the analyticsserver may generate clusters/groups of nodes within the nodal datastructure. The analytics server may cluster nodes based on one or moreattributes. For instance, each cluster of nodes may represent a projectdefined by a system administrator.

After receiving a request from a user along with an identified timeperiod, the analytics server may first identify components of workaccessed by the user operating the user device within theselected/inputted time. For instance, if the user indicates that theuser is interested in a time window corresponding to 3-5 PM onWednesday, June 23, the analytics server may first identify all useractivity (e.g., files accessed, messages sent, workflow componentsrevised, and any other activity implemented on the user device orotherwise associated with the user even if performed from a differentuser device) within the inputted time.

The analytics server may then identify nodes within the nodal datastructure that correspond to the user's activity within the identifiedtime window. For instance, if a user has access to file A during theselected time, the analytics server may identify file A and may identifya corresponding cluster of linked nodes to which file A belongs. Inessence, the analytics server may identify a project that corresponds tothe identified nodes, which will likely identify a project associatedwith the inputted time. For instance, the analytics server may identify(using metadata associated with the related nodes) a project name thatall the related nodes have in common. As a result, the analytics servermay conclude that the related nodes correspond to a particular project.As a result, the analytics server concludes that the user's activityduring the identify time window is related to the identify project.

At step 1410, the analytics server may dynamically display theidentified linked clusters. The analytics server may display anindicator associated with the identified project (e.g., length clustersof nodes). FIG. 15A is a non-limiting example of the indicatorsdisplayed by the analytics server. As depicted in GUI 1500, a user mayselect various time periods. For instance, the user selects time windows1510, 1540, and 1550. The analytics server first identifies the user'sactivities during the identified time windows. The analytics server mayalso identify a cluster of linked nodes that include the identifiednodes accessed by the user within the time windows. For instance, theanalytics server displays indications of what files were accessed in thegraphical elements 1520A-C (corresponding to time windows 1510, 1540,and 1550 respectively). Using the methods/system described herein, theanalytics server identifies corresponding projects worked by the user inthe corresponding time window.

The analytics server may also display an indication of a suggestedproject via graphical elements 1530A/C. For instance, graphical element1530A indicates that (based on the user's activity in that time), theuser was working on project 1. Graphical indicator 1530B suggests thatthe user worked on project 2 from 10:30 AM to 1:00 PM.

The suggested projects and/or time windows may include visually distinctelements. The visually distinct elements (e.g., different colors) maycorrespond to a degree of certainty associated with the suggestedproject. For instance, if the analytics server determines that a timewindow is associated with a project with a high degree of certainty, theanalytics server may display the time window and/or the project nameusing green colors. Similarly, the analytics server may use red coloringfor the time window 1540 because the analytics server identifies project2 with a low degree of certainty.

To achieve the above-described visual distinction, the analytics servermay generate a likelihood that the identified cluster is associated witha particular time window. The likelihood may correspond to a number ofnodes accessed by the user within the time window that are alsoassociated with a particular project. For instance, if a user accesses10 files that are all associated with a project within a time window,the analytics server assigns a higher likelihood that the time windowshould be attributed to that particular project than another user whoaccesses one file that is associated with that particular project.

In some embodiments, the analytics server may identify the cluster ofnodes associated with a first user's work hours based on projects workedon by other users. For instance, as described above, the analyticsserver may generate an organizational chart for an organization. If theanalytics server identifies that multiple coworkers have attributedtheir work time (that corresponds to a particular node) to project A,the analytics server may suggest project A for all users who haveaccessed same, similar, and/or related nodes. In this way, the analyticsserver may predict a project based on a user's coworkers who are workingon the same project or have similar job descriptions andresponsibilities.

The analytics server may also provide users with the option of approvingand submitting the suggested projects. The user may, upon selecting thecorrect suggested project, approve and submit the timesheet. Ifapproved, the analytics server may use the user's approval to revise thenodal data structure accordingly. For instance, if the user approvesthat the content displayed in the graphical element 1520A are indeedrelated to project 1, the analytics server revises the nodal datastructure, such that nodes corresponding to the content displayed in thegraphical element 1520A are connected to nodes corresponding to project1.

In some configurations, the analytics server may use the revised nodaldata structure to display related information when displaying dataassociated with a project. As depicted in GUI 1502, when the analyticsserver displays information related to the project 1570, the analyticsserver may display tasks (and a corresponding time) performed by theuser.

In some configurations, the analytics server may allow a user todesignate a project when creating new tasks or other workflowcomponents. The analytics server may use the designated task to identifya project when prompted. As depicted in FIG. 15C (GUI 1504), a user maygenerate various tasks displayed in the graphical component 1592 and maydesignate the tasks to the project 1590. When prompted (e.g., when theuser accomplishes the tasks and requests the analytics server toidentify a corresponding project), the analytics server uses thisdesignation to suggest the project 1590.

In some configurations, the analytics server may use the methods/systemsdescribed herein to provide metrics associated with each project. Asdepicted in FIG. 15D (GUI 1506).

The analytics server may also use associated time entered byusers/employees to revise the nodal data structure. For instance, when auser submits his/her timesheets (even if the user is using third party,not generated by the analytics server, software or systems to log time,and so long as the analytics server can integrate with/ingest the data),the analytics server may revise the nodal data structure accordingly bybuilding relationships between the nodes corresponding to relatedactivity events, files, emails, and/or tasks and a given project/client.In an example, when a user identifies his/her worked time period asassociated with a project, the analytics server may identify a clusterof nodes associated with the user, which the user has interacted withduring that particular time period, and associate the nodes within theidentified cluster with one or more nodes of the project. That is, theanalytics server may revise the nodal data structure and the relativescores and labels between nodes in accordance with the projectassociations identified by users. The analytics server may also linknodes corresponding to different employees based on their identifiedprojects. For instance, the analytics server may link nodes associatedwith employees who are working on the same and/or related projects.

The methods/systems described herein can be used to generate userbehavior patterns. For instance, the analytics server maycontinuously/periodically monitor how a user accesses various files,tasks, and other workflow components. Consequently, the analytics servermay generate a model that represents the patterns of behavior associatedwith each user. The analytics server may use the generated behaviormodel to predict whether a user's account or device has beencompromised, as well as flag potentially dangerous intentional actionsby the user. The methods/systems described herein can be utilized togenerate news alerts, customized for users based on each user's uniqueworking patterns and relationships to other users.

The method depicted in FIGS. 16A-B uses the relationships describedherein to analyze a user's activity (working patterns) and to identifysuspicious behavior. For instance, when a user's working pattern hasdrastically changed or when a user is viewing data that does notcorrespond to his/her organizational chart (e.g., employee A isdownloading files associated with a project on which employee A has notworked), the analytics server may determine that the user's computer mayhave been compromised. In other embodiments, the analytics server canidentify employees that are about to quit by identifying user behaviorpatterns and comparing them to previously compromised users and/ordisgruntled employees. Some of the patterns may be easier for theanalytics server to identify. For instance, a user who starts blockingtime in the middle of the workday, forwards messages or downloads oldfiles that he/she has worked on may exhibit a pattern that is consistentwith an employee who is unhappy and/or prepare for his/her departure.Other examples can also include identifying emotional duress, stress,etc. which can be gleaned through sentiment analysis of consolidatedcomponents of work

FIG. 16A-B illustrate operational steps for identifying cybersecuritythreats by identifying relationships between different users andelectronic files. The analytics server can understand patterns aroundpeople as described in the FIG. 16B, and can also understand informationaround other understood contexts (e.g., a project) that areapplication/ecosystem agnostic as described above. For example, theanalytics server may be able to identify that access and usage patternsaround a team/project/other context are strange, where traditionalapplications might miss it due to either only seeing a piece of activityrelated to a team or project, and/or because it lacks an understandingof what these broader business contexts or objectives are. It may alsobe able to raise an alert around a given node (e.g., file) that isexperiencing unconventional/unauthorized activity across systems (e.g.,a file with sensitive information being sent via unsecure emailmessage). Furthermore, the analytics server can be used to consolidatehistorical information from a variety of different systems in order tomake it easy for individuals to audit or do e-discovery in the event ofa breach other problem. As before, the analytics server can deduplicatedata, pull in related context, and show components of work that arerelated to a higher lever business objective or concept (e.g., team,client, project, and deal).

At step 1610, the analytics server may periodically scan a plurality ofelectronic data repositories accessible to a plurality of computingdevices to identify a plurality of files stored onto the plurality ofelectronic data repositories where each file is accessible to at leastone computing device within the plurality of computing devices. At step1620, the analytics server may execute a predetermined protocol togenerate at least one unique identifier of each file within theplurality of files. At step 1630, the analytics server may retrieve, foreach file, context data comprising at least one of a time stamp, access,and edit history.

At step 1640, the analytics server may execute a computer model toidentify related files based on each file's context data and uniqueidentifier to generate a plurality of groups where each group comprisesat least one electronic file. At step 1650, the analytics server maymonitor electronic communication between a set of users to identify aset of electronic communications between at least two users, where eachelectronic communication is associated with at least one group of files.

At step 1660, the analytics server may identify context data associatedwith each identified electronic communication event, the context datacomprising at least a time stamp of each electronic communication eventand an electronic file and its corresponding group. At step 1670, theanalytics server may generate a behavior model based on each user'selectronic activity associated with at least one file, the electronicactivity of each user comprising at least a time stamp of each useraccessing at least one electronic file, revisions of each file, one ormore electronic files accessed, time duration of the user accessing eachfile. At step 1680, the analytics server may monitor electronic activityof each user within the set of users.

At step 1690, the analytics server may, in response to identifying thata monitored electronic activity of at least one user within the set ofusers does not match the behavior model generated by the server,transmitting, by the server, a notification to a computing device, thenotification comprising a message indicating the user, the monitoredelectric activity, a time stamp of the monitored electronic activity.

Using the methods/system described herein, the analytics server may helpusers better manage and prioritize the flow ofinformation/messages/notifications they receive at any given point intime by intelligently focusing on notifications related to a particularcontact or project. The methods and systems described herein mayautomatically rank the importance of notification related to differentcontacts and projects. Conventionally, activities (e.g., conversationsand notifications) revolving around a project are fragmented, spreadout, and repeated. As a result, response times to these notificationsmay be creating longer bottlenecks and a negative user experience. Theanalytics server may solve the above-described problem by providingsmart notifications (e.g., context aware notification system).

Smart notifications refocus many notifications and messages received byusers around contexts that are more relevant to users. In a non-limitingexample, a busy user, who is busy in between meetings all day, gets afew minutes to look through the messages he or she has accumulatedacross a variety of applications (e.g., notifications regarding emails,slack messages, and other notifications). Conventionally, he/she has togo through each application and scan for important notifications. Usingthe context-based notifications provided by the analytics server, he/shecould receive a consolidated list of messages, notifications, and actionitems organized based on various attributes (e.g., contacts, projects,clients, deals, teams, classes, or relevance).

As depicted in FIG. 17A, the analytics server can generate notificationsthat are optimized to fit the user's working mental models and are basedon different projects, clients, portfolios, teams, employees (e.g.,hierarchy of employees), and/or different reports. The analytics servermay group and prioritize a user's numerous notifications based on aprioritization attribute received from the user.

FIG. 17B illustrates operational steps for generating smartnotifications, in accordance with an embodiment. In step 1702, theanalytics server may generate a nodal data structure corresponding tofiles accessible and/or accessed by various users and user deviceswithin the network. As described throughout the present disclosure, theanalytics server may not limit the methods and systems described hereinto files. Therefore, the methods and systems described herein apply toany workflow component, such as tasks, messages, notifications, and anyother data generated and/or stored within the nodal data structure.

The generation of the nodal data structure is described in detailthroughout disclosure. The analytics server may then rank thenotifications based on the contextualization. For instance, theanalytics server may display the notifications based on howrelevant/important they are to the user. In this way, the user mayreceive the notification not based on a chronological order (e.g.,conventional systems) but based on how relevant and/or important thenotification is to the user.

Using the methods and systems described herein, the analytics server mayfirst contextualize notification and then rank the notifications basedon their respective context realizations. For instance, the analyticsserver may first identify how notification relates to a user (e.g., howimportant is the person who originated the notification, how importantis the project to which the notification relates, and/or how importantis the file to which the notification relates).

At step 1704, the analytics server may receive one or moreprioritization attributes from a user device. The analytics server willuse this attribute to prioritize various notifications associated withthe user and/or the user device. In some embodiments, the analyticsserver may display a prompt to receive the prioritization attribute fromthe user. The prioritization attribute may include a classification ofnotifications that must be prioritized by the analytics server.Non-limiting examples of prioritization attributes may include projects,clients, portfolio, team, team hierarchy, and the like.

Another prioritization attribute may correspond to prioritizingnotifications based on a relative importance of other users within thenetwork. In that embodiment, the analytics server may prioritizenotifications based on their corresponding user (e.g., users whooriginated the notification) and how important those users are relativeto the user receiving the notifications.

If no prioritization attribute is received, the analytic server mayidentify the user's working patterns (e.g., the user's hierarchy withinthe organization chart and projects associated with the user). Theanalytics server may use the user's working pattern as the defaultprioritization attribute. For instance, if a user has not identified adesire to prioritize his/her notifications by for example following orunfollowing certain contacts, projects, teams, etc., the analytic servermay prioritize the user's notification based on the projects in whichhe/she is involved and has recent activity over.

In step 1706, the analytics server may receive a set of notificationsassociated with the user device. The analytics server may receivevarious notifications that are associated with the user device. Thenotifications may correspond to notifications generated by varioussoftware tools and application. For instance, various applications andsoftware tools utilized by a user operating the user device mayautomatically generate the notifications. In another example, athird-party application may transmit a notification to the userinforming the user of an update and/or reporting an issue associatedwith a file and/or a project. In another embodiment, another user maytransmit a notification to the user device. For instance, a team membermay transmit a message associated with the user device. Notificationsmay also include received messages, updates to components of work,changes to contexts (e.g., new user added to a project), birthdayreminders, alerts, and the like.

In step 1708, the analytics server may generate a score for eachnotification. The analytics server may first parse the notification andidentify its attributes. For instance, the analytics server may firstidentify what service each notification originates from, who initiatedthe activity that resulted in a notification, and what component of work(i.e. file, task, and message) the notification relates to. Furtherdescribed in a non-limiting example, the analytics server may get thisinformation directly from the system/service where the activityhappened. In another embodiment, the analytics server could parse thenotification and identify the originating source (e.g., name of theoriginating person/server). The analytics server may then identify thecontent of each notification. For instance, the analytics server mayidentify a project, client, and/or a portfolio associated with eachnotification. Additionally or alternatively, the analytics server mayidentify a user associated with each notification. For instance, theanalytics server may execute a natural language processing algorithm toidentify whether the notification is associated with another user withinthe network. In some embodiments, the analytics server may identify theuser who has originated the notification. The analytics server may thengenerate each score accordingly.

To ingest various activity events and notifications, the analyticsserver may use different analytical protocols and methods. Theseprotocols and methods may differ based on the origination of thenotification. For instance, if the analytics server is ingesting thenotification from a browser, the analytics server may use a browsernotification flow. If the analytics server is ingesting the notificationfrom software as a service (Saas), the analytics server may use a Saasevent flow or a Saas polling flow.

For notifications received from a browser, the analytics server may usean extension code and inject a script to modify browser notificationAPI. The analytic server may use a modified API to capture a copy ofeach browser notification (e.g., notifications that are sent by an openwebpage to your browser). The analytics server may retrieve each copiedbrowser notification. The analytics server may then collect copiedbrowser notifications from multiple browser tabs and windows and frommultiple devices associated with the same and/or different users. Theanalytics server may then process collected browser notifications intoevents for the purposes of standardization, deduplication, the additionof metadata, and combining with events from other sources. The analyticsserver may then transmit the processed events/notifications ascontextualize and prioritize notifications to different client devices.These notifications may be sent to client sites offer applications thatmay or may not be generated and/or updated by the analytics server. Theclient-side software applications may receive the process events andnotifications and store them as local/temporary state (e.g., within alocal/temporary repository). The client sites offer application may thendisplay the notifications and events based on the metadata accompanyingeach notification and or event. The metadata described herein may referto the data produced by the analytics server that identifiescontextualization and prioritization of each notification or event.

For SaaS event flow, the analytics server may use per-product APIs andconfiguration consoles. For instance, the analytics server may subscribeto pushed events from various SaaS products and offerings (such as thirdparty data repositories and software tools). Using pushed technology,the analytics server may receive the pushed events and notifications.The analytics server may use the methods and systems described herein toprocess and analyze the received pushed events and notifications (e.g.,received events are processed for the purposes of standardization,deduplication, the addition of metadata, and combining with events andnotifications from other sources). The analytics server then combinesevents and notifications and displays them on a client device asdescribed above.

For SaaS polling flow, the analytics server may periodically pollvarious SaaS products and offerings for new and updated data, such asevents and/or notifications. The analytics server may then process thepolled data based on methods and systems described herein (e.g., eventdata is processed for the purposes of standardization, deduplication,the addition of metadata, and combining with events and notificationsfrom other sources). The analytics server then combines events andnotifications and displays them on a client device as described above.In some embodiments, the analytics server may use the SaaS polling flowmethodology and poll data from users within an entity (e.g., employees).

Event and notification metadata may consist of an identifier for theSaaS product, originating web page, specific account information(specifically for the case of multiple accounts with a single SaaSproduct), and/or other information. Deduplication occurs throughcomparison of event contents (e.g., identical messages) and eventmetadata (e.g., same or different accounts on one SaaS product).

Events may be stored at various points in the process for variousreasons and in various ways for such purposes of at-least-once delivery,delayed processing, load balancing, and analytics server client polling.

In general, the analytics server may retrieve various events andnotifications (e.g., non-blocking duplication, by the browser extension,and of browser notifications). The analytics server may also execute aprotocol to generate a unified and common format among all retrievednotifications. Because different notifications originate from differentsources (e.g., Saas, client devices, and internal/external softwaretools), they might be in different formats that may or may not becompatible with the analytical protocols used by the analytics server.As a result, the analytics server may first generate a common formatamong all events and notifications. The analytics server may then beduplicate the events using the methods and systems described herein. Theanalytics server may then hydrate the data by adding metadata indicativeof the context and priority data associated with each event andnotification. Finally, the analytics server may display thenotifications using the methods described herein.

Based on the retrieved attributes of each notification, the analyticsserver may generate a score for each notification. The analytics servermay use various predetermined scoring rules to calculate each scoreaccordingly. The scoring algorithm may correspond to the prioritizationattribute received in step 1704. For instance, the user has indicated adesire for the analytics server to prioritize the notifications based onteam hierarchy, the analytics server may generate the score based on anodal data structure, such as team hierarchy described in FIG. 13B. Inthat embodiment, the analytics server may generate a higher score forusers who are higher in the team hierarchy. As a result, a notificationthat was originated by and/or is associated with a person who isidentified as having a high score within the team hierarchy will have ahigher score than other notifications. That is, a notificationassociated with a user's manger will have a higher score than anotification associated with the user's peers.

In another example, if the user has indicated a desire for the analyticsserver to prioritize the notifications based on projects he/she isactively working on, the analytics server may calculate a higher scorefor a notification that is related to a particular project vs. otherprojects. For instance, if a notification includes the project name oris related to a component of work that is related to the project, thenotification may score higher than other notifications that do notinclude the project name. In some configurations, the analytics servermay identify projects related to a user (using the nodal datastructure). The analytics server may then generate higher scores fornotifications associated with the identified projects.

In step 1708, the analytics server may display the notification inaccordance with each notification's respective score. The analyticsserver may prioritize displaying (or otherwise outputting, such ashaptic, auditory, and otherwise). For instance, the analytics server maydisplay notifications with a higher score before notifications withrelatively lower scores. In another example, the notifications withhigher scores may be outputted differently. For instance, a prioritizednotification may be accompanied with a haptic or sound notification.

The analytics server may also contextualize each event and notificationusing the methods and systems described herein. The analytics server maythen augment data associated with each event and notification, such thatthe analytics server adds the contextualize data to each event andnotification. In a non-limiting example, the analytics server identifiesother users associated with each notification. Using the methods andsystems described herein, the analytics server may identify whether theassociated users are relatively important to the end user receiving thenotification. For example, if a notification is associated with a userwho is high within an organization chart, the analytics server maygenerate metadata indicating that the notification is possibly importantto the user receiving notification. The analytics server may use any ofthe methodologies described herein to add contextualization data to eachnotification. For example, the analytics server may identify a projectrelated to the notification I may add project specific data ascontextualize data to the notification.

When the analytics server receives a prioritization attribute from theuser receiving the notification, the analytics server may first retrieveall the contextualized data and rank the notifications based on theprioritization attribute.

A non-limiting example of prioritized notifications are illustrated inFIG. 17C, in accordance with an embodiment. Notification cluster 1726A-Billustrates an example where the analytics server prioritizes originatorusers (e.g., important colleagues). The notification cluster 1726Cillustrates an example where the analytics server prioritizesnotifications associated with a team over other notifications. Thenotification cluster 1726D illustrate an example where the analyticsserver prioritizes notifications associated with particular project.

Using the methods and systems described herein, the analytics server canoptimize notifications. For instance, as depicted in FIG. 17C, whileconventional notification systems organize notifications chronologically(e.g., GUI 1720) or based on applications (e.g., GUIS 1722 and 1724),the analytics server can organize notifications by context (e.g.,notification clusters 1726A-D).

In some embodiments, the analytics server may generate and prioritizethe notification. For instance, the analytics server may monitor one ormore users and their corresponding devices. When the analytics serveridentifies an update associated with a user, the analytics serveridentifies that a user has accessed, revised, and/or interacted with afile or any other workflow components, the analytics server may generatea notification accordingly (e.g., user X has revised file Y of projectZ). The analytics server may then prioritize the generated notificationsfor each user. For instance, all users associated with project Z mayreceive the above-described notification.

Most modern knowledge workers are involved with multiple projects, whichrequires them to access multiple files hosted on multiple (internal orexternal) applications and software tools. Currently, for simultaneousaccess to files (and other workflow components, such as tasks,notifications, and the like) related different projects, users mustaccess multiple software tools, which has created negative userexperiences and created inefficiencies. For instance, a typical employeemay have multiple browsers, software applications, task organizationapplications, and other project related software applications open atthe same time. Managing multiple applications decreases efficiencies.

Using the methods and systems described herein, the analytics server mayprovide a browser extension that provides all relevant data associatedwith a particular context. The context may represent any category ofdata selected by the user. For instance, the context may correspond to aproject, a particular aspect of a project, or any other category of datadescribed herein (e.g., all data associated with a particular employee).Using the methods and systems described herein, the analytics server maytraverse the nodal data structure, identify relevant nodes, analyzedidentified nodes, and provide context-specific data.

In a non-limiting example, a user may generate multiple browser tabswhere each tab corresponds to a particular project or any otherattribute of his/her workflow. When the user accesses a browser tab, theanalytics server only displays data associated with that particularproject. Therefore, the user may access all software tools, files,notifications, messages, and any other workflow component from theproject's browser tab. This eliminates the need to execute and initiatemultiple software applications. The browser tabs are customizable, suchthat users may customize notifications messages or any other workflowcomponents according to their needs. Various applications (internal orexternal) and software tools can be loaded onto each browser tab toprovide easy access. These applications and software tools could bedesktop applications, web applications, and/or other websites. Theanalytics server enables users to relate different accounts to eachapplication, such that a user can be simultaneously logged into multipleaccounts for the same type of application (e.g., two social mediaaccounts).

As will be described throughout, the methods and systems describedherein can be implemented within a browser extension. However, in otherembodiments, the methods and systems described herein can be implementedinto any software application.

The graphical user interfaces and browser extensions/tabs describedherein (e.g., FIGS. 18A-O) are also referred to herein at “smartwindows.” The disclosed platform that utilizes the methods and systemsdescribed herein offers software products that allow users to group datainto these contexts so that they can have a holistic view (and access)without being required to execute multiple browsers and applications.The disclosed platform improves focus and productivity, while helpingmanage content organization across different applications. Using themethods and systems described herein, the analytics server allows usersto load websites into their browser as native applications, and thengroup websites and resources into context.

The methods and systems described herein can be implemented as a browserextension that transforms browsers into collaborative and smartworkstations where users can access information from websites at thebrowser level (e.g., without loading the webpages), as well as load andoffload work sessions (and tabs) contextually. While some embodimentsdescribed herein are described in the context of a web-basedapplication, such as a web browser, the method 1800 can be implementedon any application (e.g., desktop applications, databases, and/orcloud-based applications).

The analytics server may also provide a keyboard navigable version ofthe same concepts, where users can use keyboard shortcuts and basiccommands to search through, access, and otherwise interact withinformation within the nodal data structure.

Because the analytics server identifies and aggregates relevant data,e.g., by generating the nodal data structure, users can organize theiractivities more efficiently. For instance, users can search data,generate messages and tasks, identify contacts, and receive alerts usingone workflow browser application (or a single browser tab).

FIG. 18A illustrates operational steps of displaying customize browsertabs based on identified projects associated with each user, inaccordance with an embodiment. At step 1802, the analytics server mayperiodically scan a plurality of electronic data repositories accessibleto a plurality of computing devices to identify a plurality of filesstored onto the plurality of electronic data repositories where eachfile is accessible to at least one computing device within the pluralityof computing devices. At step 1804, the analytics server may execute apredetermined protocol to generate at least one unique identifier ofeach file within the plurality of files (or any other workflowcomponent). At step 1806, the analytics server may retrieve, for eachfile, context data comprising at least one of a time stamp, access, andedit history. At step 1808, the analytics server may execute a computermodel to identify related files based on each file's context data andunique identifier to generate a plurality of projects where each projectcomprises at least one electronic file.

Using the method 1800, a user can create his/her own contexts through aGUI or API. The analytics server creates filters and/or models to mapnodes to a given context. As used herein, context may refer to anyattribute or classification designated by the user. For instance, thecontext may refer to a GUI specific to a project, team, client,portfolio, or any other classification desired by the user. Forinstance, the user can instruct the analytics server to generate thecustomize GUI that displays information relevant to one or moreattributes. The analytics server may use the attributes received twoidentified nodes that are associated with that attribute (e.g., nodesassociated with a project, client, employee subgroup, and/or aparticular department).

The context (the customized GUI) can be shared with others, so that theshared content of each customized (e.g., a shared project, team, andclient) does not have to be created multiple times by different people.Each user may then accordingly further customize the GUIs byadding/removing files, applications, and other content from eachcustomized GUI (context). New users can login and already have relevantapplications and context associated with their profile/workspace. Asinformation reaches analytics server from integrated systems, data isprocessed and may be associated with a given context. The analyticsserver may then dynamically update each context according to the newdata received. Therefore, each user may use his/her context as aone-stop shop to access his/her relevant data.

Additionally or alternatively, when a user opens a context via GUI or(or other user/system accesses Context via API), the analytics servercan perform a search based on certain attributes and return componentsof work as well as recommendations. Recommendations can come frombrowsing history, activity, notifications, saved tabs, pinned resources,etc. The context is an “app-like” experience because it is easy to open,easy to search, provides notifications control, andscreen-time/time-tracking, anther relevant features.

In steps 1802-1808, the analytics server may monitor user interactionsand may generate a nodal data structure corresponding to file accessibleand/or accessed by various users and user devices within the network.The generation of the nodal data structure is described in detailthroughout disclosure. The nodal data structure may correspond torelated files and/or workflow components within a network (e.g.,organization).

At step 1810, the analytics server may receive one or more attributesfrom a user device. The analytics server will use this attribute toidentify relevant nodes, group electronic content, and display the GUIsdescribed herein. In some embodiments, the analytics server may displaya prompt to receive the attribute from the user. The attribute mayinclude a classification of data or context selected by the user.Non-limiting examples of the attribute may include projects, clients,portfolio, team, team hierarchy, and the like. By selecting anattribute, the user instructs the analytics server to generate a GUI(e.g., browser tab) that displays data associated with that attribute.For instance, a user may instruct analytics server to generate a browsertab that displays data (e.g., files, notification, messages, and/ortasks) associated with a particular project, objective, or othercontext. The grouping of nodes from the nodal structure around anattribute and the graphical user interface that represents it is alsoreferred to herein as a context.

If no attribute is received, the analytic server may identify the user'sworking patterns (e.g., the user's hierarchy within the organizationchart and projects associated with the user). The analytics server mayuse the user's working pattern as the default attribute. For instance,if a user has not identified a desire to generate a context browser, theanalytic server may automatically consolidate and suggest workflowcomponents associated with different projects in which the users isinvolved.

Upon receiving the attribute, the analytics server may use the methodsand systems described herein to identify a cluster of nodes thatcorrespond to the attribute. For instance, if the attribute correspondsto a project name, the analytics server may identify all nodes withinthe nodal data structure that are related to that particular project.That is, the analytics server identifies a cluster of nodes that arerelated to the attribute. The analytics server may traverse the nodaldata structure and use the identified cluster of related nodes todisplay data as described herein.

At step 1811, the analytics server may dynamically generate a first linkdisplayed by the browser and configured to direct the browserapplication to one or more electronic files associated with identifiedcluster of related nodes when the user initiates a browser application.For clarity and brevity, the embodiments described herein depict abrowser tab that displays data grouped based on projects (sometimesreferred to herein as context). However, it is expressly understood thatdata within the nodal data structure can be grouped based on any otherattribute. For instance, the method 1800 may be used to generate abrowser tab that groups data for different clients, portfolios, and thelike.

Using the methods and systems described herein, the analytics server mayidentify files and workflow components related to different projects.The analytics server may search for and retrieve nodes that correspondto different files and workflow components that are relevant to theattribute received from the user. In other words, the analytics servermay use the nodal data structure to identify nodes that are associatedwith and relevant to the received attribute. The analytics server maythen generate various GUIs that display graphical elements thatcorrespond to files and other workflow components corresponding to theretrieve nodes.

The analytics server may generate customized browser tabs where each tabis specific to one or more projects associated with the user. When theuser interacts with a browser tab, the analytics server then displaysproject-specific information for the user including multiple links(e.g., graphical components) providing the user with the option toefficiently access data organized based on different projects to whichthey relate. In this way, users can minimize the number of browser tabsand active applications, which creates a better user experience.

Referring now to FIGS. 18B-P a non-limiting example of a browserextension displaying a browser tab that groups/clusters data based on anattribute is depicted, in accordance with an embodiment. The browserextension can be installed and implemented on any browser, such that theinformation can be displayed by the analytics server. FIG. 18Billustrates a typical web browser equipped with a browser extensionprovided by analytics server.

To start, a user may interact with the graphical indicator 1812 and theanalytics server displays the GUI 1824. In some configurations, this tab(graphical indicator 1812) will always be visible to the user. Thegraphical indicator 1812 is comparable to a computer's “desktop” forusers' browser. The graphical indicator 1812 may represent the homespace for cloud/web-based workflows, which the user can always access byinteracting with the graphical indicator 1812 and/or 1814. Byinteracting with these graphical indicators, a user can launch anyapplication he/she might want from “Home,” pin any application to thedock (graphical elements 1816) for easy access, search for anythingusing the search bar 1818, and view his/her recent working history overthe past several days (graphical elements 1820).

A user can also choose to keep certain contexts in their dock (graphicalelement 1822), such that the applications are easily accessible from the“Home” screen displayed in GUI 1824. The user can always access allpinned and unpinned applications and contexts from search bar 1818, suchthat she/he does not have to keep all of the contexts within hisworkspace always pinned to the Home screen. Also, similar toapplications, notifications, recently visited pages, integrated search,and more is easily accessible without having to open others applicationsor other software tool itself, (e.g., FIG. 18E). The analytics serveralso allows users to easily search and see units of work (notifications,files, messages, notes, etc.) relevant to that context withoutopening/loading any of the related webpages or applications. In thisway, the analytics server provides an “app-like experience” forcontextual workflows (e.g., objectives, clients, classes, teams,projects, etc.) where information and accessibility is greatly improvedin contrast with a traditional website or fragmented workflows acrossapplications.

When a user initiates a context, the user is able to continue workingfrom where the users last terminated his/her work (e.g., left off worklast time). This solution greatly improves an individual's workflowwhere she/he can log off or try to keep all the applications active onhis/her computer. The analytics server provides a way for the user toeasily maintain visibility and control into relevant activity acrossapplications/accounts by context (e.g., project and/or objectives),while maintaining a clean and organized workstation without anoverwhelming number of open windows. This improves both the user'sperformance as well as his computer's performance.

When a user chooses to continue working on a given context, theanalytics server can retrieve the user's last known state of work forthat given context, and allow the user continue working with the samewindow and tabs open as previously accessed. This can include loggingthe user into the corresponding application and accounts. For instance,if the user decided to include a third party email application, theanalytics server can integrate the third party application in the GUIsdescribed herein. The analytics server can also keep the user loggedinto the third party email account, such that the user is no longerrequired to initiate that third party application to check his/heremails. An alternate embodiment would allow multiple windows per contextand could include both online and offline work.

Additionally or alternatively, the analytics server may provide users tohave a context open as multiple windows in a “desktop,” which comprisescontextual files (files arranged based on their context data retrievedfrom the nodal data structure), notifications, messages, tasks,contacts, questions & answer forum, calendar, and the like.

FIG. 18C illustrates an embodiment where a user opens a context byeither clicking on that particular context it from the dock or searchingfor it. When opened, the analytics server searches for the user's or thecontext's previous state, retrieves all pertinent data, and loads in allthe necessary websites, tabs, logins, and components to match thatprevious state. Graphical indicator 1826 indicates to the user whatcontext is currently open (the project corresponding to the GUI depictedin FIG. 18C). The graphical indicator 1828 illustrates all the workflowcomponents (e.g., applications, task, and third-party items) to whichthe use has access (e.g., the user has open). In some embodiments, opentabs, illustrated in the graphical component 1828, maybe stated, suchthat the user can always access them. For instance, the analytics servermay display icons associated with the workflow components, such that theuser can view/access them as a browser tab.

When the user closes a workflow component, such as an application,changes tabs, visit the website, or performs any action, the analyticsserver saves/stores the state/status of the user, such that the user canterminate any or all parts of the graphical user interface shown here.In that way, when the user re-initiates the workflow component, theanalytics server can display a latest status of the workflow component.

The graphical components 1830B-M illustrate sections where a user canpin (e.g., permanently pin) resources to the GUIs described herein.Graphical user interfaces provided by the analytics server provide easymethods for a user to link key websites, documents, contacts, messages,applications, and other resources to a given context so that users caneasily access the content. The analytics server allows the users toaccess the pinned content, even after the user has terminated theapplication itself. For instance, graphical component 1830C illustratesthat a user has previously pinned a working document titled “native mileproduct forecasting.” Even if the underlying document is no longeractive (e.g., the user has terminated or closed the document or theapplication that hosts the document), the analytics server provides theuser the opportunity to have access the document using the graphicalcomponent 1830C. For instance, the user can re-initiate and view thisdocument by interacting with the document's icon.

The graphical component 1830B shows a given application and account thatis also pinned. The user can choose to show different things for a givenapplication. FIG. 15D (or 18D) illustrates alternate embodiments ofthese pinned sections where user is able to pin various workflowcomponent (e.g., tasks, files, and messages) to a browser provided bythe analytics server (e.g., a native application provided by theanalytics server). Alternate embodiments of these sections include beingable to pin sections showing live data embedded within components ofwork and widgets that show live data within applications and systems asillustrated in FIG. 15/8D.

Referring back to FIG. 18C, the user is able to have certain shortcutsassociated with that given application, or can change the view to seerecently viewed, starred components of work, activity, notifications forthat application, and the like. The graphical component 1832 illustrateshow the analytics server may organize various categories of files and/orworkflow components.

In some embodiments, the workflow components may include content that isshared with other projects or context. For instance, the graphicalcomponent 1830A/E illustrates pinned resources that may correspond tomore than one contexts. In essence, the content pinned and illustratedin the graphical component 1830A/E represent links shared between twodifferent contexts. This may be similar to the links between files orother nodes within the neural data structure. Similarly, the analyticsserver can recommend these associations. However, links between contextsmay be different in that they can be shared metadata tags for resourcesor components of work relevant to both sections. Therefore when linkingtwo contexts together, users may create a shared section both contexts,which essentially represents an intersection between the contexts. Inthe depicted example, the user has a context for “Contracts” (1830A) anda context for “Research” (1830E). In an embodiment, interacting with the“contracts” context would cause the analytics server to show a number ofother sections for a variety of clients, one of which is “Client12-Superhoops”. Similarly when looking at each client's context, theanalytics server would show a contracts section in each.

The graphical user interfaces described herein help users view pertinentdata associated with each project. The users can simultaneously workacross multiple applications, platforms, and the like. The graphicaluser interfaces described herein provide sharing features to onboardcollaborators to different projects, across all different applicationsand accounts. The analytics server can also augment users' web browsingwith useful data related to various contexts using the nodal datastructure described herein.

As depicted in FIG. 18E, the analytics server may display variousnotifications and messages specific to a project/context. For instance,the graphical component 1834 illustrates that the analytics serverindexes all files, messages, tasks, notifications, starred items, andthe like for every integrated application and/or context. The analyticsserver may index workflow components and organize all relevant workflowcomponents as they relate to a specific application, contacts, project,or any other category selected by the user (e.g., graphical component1836).

As depicted in FIG. 18F, the analytics server allows the user totransmit and share any workflow component (e.g., file, context, and thelike) with other users. The analytics server provides the user an optionto select a workflow component and further select a user where theanalytics server shares the selected content with the recipient (e.g.,graphical component 1838). The analytics server may also allow the user(sender) to select editing permissions for the recipient of the content.

As depicted in FIG. 18G, the analytics server provides a searchfunctionality where the searching is limited by a category, context, orany other attribute selected by the user (e.g., attribute received fromthe user). For instance, a user may select a project (as depicted,project “client 12-D2C shirts”). The user may then use the search barprovided by the analytics server and input a query. Upon receiving atext string entered by the user, the analytics server may identify,within the nodal structure, relevant data that correspond to the queryinputted by the user and the project selected by the user. The analyticsserver may display the results in the graphical component 1842. Asdepicted, the analytics server may also populate the graphical element1840 where notifications, starred workflow components, content,messages, contacts, and shared content associated with the inputtedquery and the selected project are displayed. The user may access allthe information mapped and filter the information using categoriesdescribed in the graphical element 1840. Using the methods and systemsdescribed herein, the analytics server may search for and displaycontent related to local browsing history; open tabs and last used stateof context; websites, applications, and resources pinned to context;sharing settings; and units of work mapped to a particularproject/context.

FIGS. 18H-M illustrate how the analytics server can display componentsof work like notifications, messages, files, tasks, and contactsassociated with a particular context. The graphical user interfacesdescribed herein provide an app-like experience where data is arrangedbased on respective context (e.g., arranged based on variousattributes), easy to access, and easy to search. In some embodiments,the analytics server may monitor user's interactions with variousfeatures displayed on these graphical user interfaces. For instance, theanalytics server may track how long a user interacts with file accessedthrough a context (customized GUI). The analytics server may then usethe track time to attribute the users worked hours to a particularproject as described herein.

For instance, a user can search for data associated with the “Client12-Superhoops” project. The analytics server may first identify acluster of nodes within the nodal data structures described herein. Theanalytics server may then conduct a search for the inputted query withinthe identified cluster of nodes.

By limiting the search to the cluster of relevant nodes, the analyticsserver may increase efficiencies and provide better and more relevantresults (FIGS. 18H-M). As depicted in FIG. 18H, the analytics server maydisplay all relevant notifications. As depicted in FIG. 18I, theanalytics server may display all relevant messages. The retrievedmessages may belong to different (internal or external) applications andmessaging protocols. However, all the retrieve messages are relevantmessages to the selected project. As shown in the right-hand side ofFIG. 18I, the reasons for why certain messages are displayed isdescribed, including but not limited to, messages accessed in thatcontext, comments on files related to that context, suggestions based onother user's activity, etc. As depicted in FIG. 18J, the analyticsserver may display all relevant files. An alternate embodiment of therelevant files is shown in FIG. 18D. Finally, as depicted in FIG. 18K,the analytics server may display all relevant tasks, which may or maynot belong to the same application. The analytics server may alsoprovide the users with the option of sorting and/or filtering theretrieve content based on various attributes selected by the user.

In some embodiments, graphical user interfaces or at least parts of thedescribed GUI features may be integrated into third-party application.For instance, features described herein can be integrated into a thirdparty (or sometimes internal) messaging application as depicted in FIGS.18L/M. For instance, one or more features described herein can bedisplayed as a part of the depicted messaging application (e.g.,graphical element 1844). Moreover, the analytics server may use themethods and systems described herein to sort/filter messages based onvarious attributes received from the user. For instance, messagesdisplayed in the graphical element 1846 belong to different messagingapplications. However, the analytics server displays the messages, suchthat they are sorted based on context (e.g., project) not time or thesender.

As depicted in FIG. 29, using the methods and systems described herein,a user may initiate various applications and accounts independently, ormanage them in different windows (e.g., browser tabs) grouped by variousattributes, such as projects, clients, portfolio, or any other context.

In some configurations, a user may generate multiple “smart windows” andcustomize graphical user interfaces where each customize graphical userinterface corresponds to an attribute. Referring now to FIG. 19-21, asdescribed in GUI 1900, a user may generate different dashboards thatcorrespond to different contexts. For instance, browser tab 1902 and1904 are applications opened by the user that may correspond to “AwesomeProject”. The user may use the graphical element 1906 to create newcontexts or smart window, such that each new tab within the window showsa contextual dashboard in the new tab page. This contextual dashboardcan show all relevant components of work as described above and depictedin FIGS. 18A-M and 19-21. Contexts can be embedded within one anothersuch that one context can be experienced by the user as hierarchicallynested within a different context. Similarly, users can pin and relateapplications, accounts, systems, resources, and components of work witha given context. For instance, “Project 234” may be a project in Trello®that is a part of “Awesome Project.” The user is easily able to openProject 234 in its native Trello® interface, as well as view therelevant tasks without opening Trello® by looking at the dashboards.

Using the methods and systems described herein, the analytics server maydisplay various workflow components (e.g., files, messages, tasks, andother related content) within each smart window. The user may dictatehow various data is displayed within each smart window. For instance, asdepicted in FIG. 20, the analytics server may sort the contentchronologically. In another example, as depicted in FIG. 21, theanalytics server may use multiple attributes to sort the content (e.g.,sort by due date, creator, and assignee).

The foregoing method descriptions and the process flow diagrams areprovided merely as illustrative examples and are not intended to requireor imply that the steps of the various embodiments must be performed inthe order presented. The steps in the foregoing embodiments may beperformed in any order. Words such as “then,” “next,” etc. are notintended to limit the order of the steps; these words are simply used toguide the reader through the description of the methods. Althoughprocess flow diagrams may describe the operations as a sequentialprocess, many of the operations can be performed in parallel orconcurrently. In addition, the order of the operations may bere-arranged. A process may correspond to a method, a function, aprocedure, a subroutine, a subprogram, and the like. When a processcorresponds to a function, the process termination may correspond to areturn of the function to a calling function or a main function.

The various illustrative logical blocks, modules, circuits, andalgorithm steps described in connection with the embodiments disclosedherein may be implemented as electronic hardware, computer software, orcombinations of both. To clearly illustrate this interchangeability ofhardware and software, various illustrative components, blocks, modules,circuits, and steps have been described above generally in terms oftheir functionality. Whether such functionality is implemented ashardware or software depends upon the particular application and designconstraints imposed on the overall system. Skilled artisans mayimplement the described functionality in varying ways for eachparticular application, but such implementation decisions should not beinterpreted as causing a departure from the scope of this disclosure orthe claims.

Embodiments implemented in computer software may be implemented insoftware, firmware, middleware, microcode, hardware descriptionlanguages, or any combination thereof. A code segment ormachine-executable instructions may represent a procedure, a function, asubprogram, a program, a routine, a subroutine, a module, a softwarepackage, a class, or any combination of instructions, data structures,or program statements. A code segment may be coupled to another codesegment or a hardware circuit by passing and/or receiving information,data, arguments, parameters, or memory contents. Information, arguments,parameters, data, etc. may be passed, forwarded, or transmitted via anysuitable means including memory sharing, message passing, token passing,network transmission, etc.

The actual software code or specialized control hardware used toimplement these systems and methods is not limiting of the claimedfeatures or this disclosure. Thus, the operation and behavior of thesystems and methods were described without reference to the specificsoftware code being understood that software and control hardware can bedesigned to implement the systems and methods based on the descriptionherein.

When implemented in software, the functions may be stored as one or moreinstructions or code on a non-transitory computer-readable orprocessor-readable storage medium. The steps of a method or algorithmdisclosed herein may be embodied in a processor-executable softwaremodule, which may reside on a computer-readable or processor-readablestorage medium. A non-transitory computer-readable or processor-readablemedia includes both computer storage media and tangible storage mediathat facilitate transfer of a computer program from one place toanother. A non-transitory processor-readable storage media may be anyavailable media that may be accessed by a computer. By way of example,and not limitation, such non-transitory processor-readable media maycomprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage,magnetic disk storage or other magnetic storage devices, or any othertangible storage medium that may be used to store desired program codein the form of instructions or data structures and that may be accessedby a computer or processor. Disk and disc, as used herein, includecompact disc (CD), laser disc, optical disc, digital versatile disc(DVD), floppy disk, and Blu-ray disc where disks usually reproduce datamagnetically, while discs reproduce data optically with lasers.Combinations of the above should also be included within the scope ofcomputer-readable media. Additionally, the operations of a method oralgorithm may reside as one or any combination or set of codes and/orinstructions on a non-transitory processor-readable medium and/orcomputer-readable medium, which may be incorporated into a computerprogram product.

The preceding description of the disclosed embodiments is provided toenable any person skilled in the art to make or use the embodimentsdescribed herein and variations thereof. Various modifications to theseembodiments will be readily apparent to those skilled in the art, andthe principles defined herein may be applied to other embodimentswithout departing from the spirit or scope of the subject matterdisclosed herein. Thus, the present disclosure is not intended to belimited to the embodiments shown herein but is to be accorded the widestscope consistent with the following claims and the principles and novelfeatures disclosed herein.

While various aspects and embodiments have been disclosed, other aspectsand embodiments are contemplated. The various aspects and embodimentsdisclosed are for purposes of illustration and are not intended to belimiting, with the true scope and spirit being indicated by thefollowing claims.

What is claimed is:
 1. A method comprising: periodically scanning, by aserver, a plurality of electronic data repositories accessible to aplurality of computing devices to identify a plurality of files storedonto the plurality of electronic data repositories where each file isaccessible to at least one computing device within the plurality ofcomputing devices; executing, by the server, a protocol to generate atleast one unique identifier of each file within the plurality of files;retrieving, by the server for each file, context data comprising atleast one of a time stamp and access history; executing, by the server,a computer model to identify related files based on each file's contextdata or unique identifier; generating, by the server, a nodal datastructure comprising a set of nodes where each node within at least asubset of the set of nodes corresponds to a file within the set offiles, each node within the set of nodes linked to at least one othernode; receiving, by the server, an attribute from a user; identifying,by the server within the nodal data structure, a cluster of relatednodes associated with the received attribute; and when the userinitiates a browser application, dynamically generating, by the server,a first link displayed by the browser application and configured todirect the browser application to one or more electronic filesassociated with identified cluster of related nodes.
 2. The method ofclaim 1, further comprising: generating, by the server, a second linkdisplayed by the browser application and configured to direct thebrowser application to one or more electronic communications associatedwith the identified cluster of related nodes.
 3. The method of claim 1,further comprising: generating, by the server, a third link displayed bythe browser application and configured to direct the browser applicationto third party applications associated with the identified cluster ofrelated nodes.
 4. The method of claim 1, further comprising: displaying,by the server, one or more time periods of worked hours associated withthe identified cluster of related nodes.
 5. The method of claim 1,wherein the attribute corresponds to a project.
 6. The method of claim5, wherein the server automatically identifies the attribute as aproject associated with the user.
 7. The method of claim 1, furthercomprising: generating, by the server, a search bar displayed by thebrowser application and configured to perform queries associated withthe identified cluster of related nodes.
 8. The method of claim 1,further comprising: displaying, by the server, one or more tasksassociated with the identified cluster of related nodes.
 9. The methodof claim 1, wherein the browser application is a third partyapplication.
 10. The method of claim 1, wherein the server sorts the oneor more electronic files.
 11. A system comprising: a server incommunication with at least one electronic data repository and aplurality of electronic devices each configured to access at least onefile stored onto the at least one electronic data repository, the servercomprising a processor and a non-transitory computer-readable mediumcontaining instructions that when executed by the processor causes theprocessor to perform operations comprising: periodically scan aplurality of electronic data repositories accessible to a plurality ofcomputing devices to identify a plurality of files stored onto theplurality of electronic data repositories where each file is accessibleto at least one computing device within the plurality of computingdevices; execute a protocol to generate at least one unique identifierof each file within the plurality of files; retrieve, for each file,context data comprising at least one of a time stamp and access history;execute a computer model to identify related files based on each file'scontext data or unique identifier; generate a nodal data structurecomprising a set of nodes where each node within at least a subset ofthe set of nodes corresponds to a file within the set of files, eachnode within the set of nodes linked to at least one other node; receivean attribute from a user; identify, within the nodal data structure, acluster of related nodes associated with the received attribute; andwhen the user initiates a browser application, dynamically generate afirst link displayed by the browser application and configured to directthe browser application to one or more electronic files associated withidentified cluster of related nodes.
 12. The system of claim 1, whereinthe server is further configured to: generate a second link displayed bythe browser application and configured to direct the browser applicationto one or more electronic communications associated with the identifiedcluster of related nodes.
 13. The method of claim 1, wherein the serveris further configured to: generate a third link displayed by the browserapplication and configured to direct the browser application to thirdparty applications associated with the identified cluster of relatednodes.
 14. The method of claim 1, wherein the server is furtherconfigured to: display one or more time periods of worked hoursassociated with the identified cluster of related nodes.
 15. The methodof claim 1, wherein the attribute corresponds to a project.
 16. Themethod of claim 5, wherein the server automatically identifies theattribute as a project associated with the user.
 17. The method of claim1, wherein the server is further configured to: generate a search bardisplayed by the browser application and configured to perform queriesassociated with the identified cluster of related nodes.
 18. The methodof claim 1, wherein the server is further configured to: display one ormore tasks associated with the identified cluster of related nodes. 19.The method of claim 1, wherein the browser application is a third partyapplication.
 20. The method of claim 1, wherein the server sorts the oneor more electronic files.